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DESCRIPTION 

NUCLEIC ACID TREATMENT OF DISEASES OR CONDITIONS RELATED TO 
LEVELS OF RAS, HER2 AND HIV 

This patent application claims priority from McSwiggen USSN 60/294,140, filed May 
5 29, 2001, entitled " Enzymatic Nucleic Acid Treatment of Diseases or Conditions Related To 
Levels of HIV" McSwiggen USSN 60/296,249 filed June 6, 2001, entitled "Enzymatic 
Nucleic Acid Treatment of Diseases or Conditions Related to Levels of HER2." and 
McSwiggen USSN 60/318,471, filed September 10, 2001, entitled " Enzymatic Nucleic Acid 
Treatment of diseases or Conditions Related to Levels of RAS ." Each of these applications is 
1 0 hereby incorporated by reference herein in its entirety including the drawings and tables. 



Technical Field Of The Invention 

The present invention relates to novel nucleic acid compounds and methods for the 
treatment or diagnosis of diseases or conditions related to levels of Ras gene expression, such 
15 as K-Ras, H-Ras, and/or N-Ras expression, HIV infection such as HlV-l^nd HER2 gene 
expression. 

Background Of The Invention 

Transformation is a cumulative process whereby normal control of cell growth and 
differentiation is interrupted, usually through the accumulation of mutations affecting the 

20 expression of genes that regulate cell growth and differentiation. 

The platelet derived growth factor (PDGF) system has served as a prototype for 
identification of substrates of the receptor tyrosine kinases. Certain enzymes become 
activated by the PDGF receptor kinase, including phospholipase C and phosphatidylinositol 3 f 
kinase, Ras guanosine triphosphate (GTPase) activating protein (GAP) and src-like tyrosine 

25 kinases. GAP regulates the function of the Ras protein by stimulating the GTPase activity of 
the 21 kD Ras protein. Barbacid, 56 Ann. Rev. Biochem. 779, 1987. Microinjection of 
oncogenically activated Ras into NIH 3T3 cells has been shown to induce DNA synthesis. 
Mutations that cause oncogenic activation of Ras lead to accumulation of Ras bound to GTP, 
the active form of the molecule. These mutations block the ability of GAP to convert Ras to 

30 the inactive form. Mutations that impair the interactions of Ras with GAP also block the 
biological function of Ras. 
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While a number of Ras alleles exist (N-Ras, K-Ras, H-Ras) which have been 
implicated in carcinogenesis, the type most often associated with colon and pancreatic 
carcinomas is K-Ras. Enzymatic nucleic acid molecules which are targeted to certain regions 
of the K-Ras allelic mRNAs may also prove inhibitory to the function of the other allelic 
5 mRNAs of the N-Ras and H-Ras genes. 

Scanlon, International PCT Publication Nos. WO 91/18625, WO 91/18624, and WO 
91/18913 describes a ribozyme effective to cleave oncogene RNA from the H-Ras gene. This 
ribozyme is said to inhibit H-ras expression in response to exogenous stimuli. Reddy 
WO92/00080 describes the use of ribozymes as therapeutic agents for leukemias, such as 
1 0 chronic myelogenous leukemia (CML) by targeting specific portions of the BCR-ABL gene 
transcript, 

Thompson et al, International PCT publication No. WO 99/54459, describe nucleic 
acid molecules that modulate gene expression, including Ras gene expression. 

Zhang et al t 2000, Gene Ther., 7, 2041; Talcunaga et al, 2000, Br. J. Cancer., 83, 833; 

15 Zhang et al, 2000, Mol BiotechnoL, 15, 39; Irie et al, 2000, Mol Urol 4, 61; Kijima and 
Scanlon, 2000, Mol Biotechnol, 14, 59; Funato et al, 2000, Cancer Gene Ther., 7, 495; 
Tsuchida et al, 2000, Cancer Gene Ther., 7, 373; Zhang et al, 2000, Methods Mol Med., 35, 
261; Irie et al, 1999, Antisense Nucleic Acid Drug Dev., 9, 341; Giannini et al, 1999, 
Nucleic Acids Res., 27, 2737; Fang et al, 1999, 1 Med. Coll PLA, 14, 25; Tong et al, 1998, 

20 Methods Mol Med, 11, 209; Ohkawa and Kashani-Sabet, 1998, Methods Mol Med, 11, 153; 
Scherr et al, 1999, Gene Titer., 6, 152; Tsuchida et al, 1998, Biochem. Biophys. Res. 
Commun., 252, 368; Scherr et al, 1998, Gene Ther., 5, 1227; Uhlmann et al, European 
Patent Application EP 808898; Scherr et al, 1997, J. Biol Chem., 272, 14304; Chang et al, 
1997, J. Cancer Res. Clin. Oncol, 123, 91; Ohta et al, 1996, Nucleic Acids Res., 24, 938; 

25 Ohta et al, 1994, Ann. NY. Acad. Scl, 716, 242; and Funato et al, 1994, Biochem. 
Pharmacol, 48, 1471 all describe specific ribozymes targeting certain K-Ras, H-Ras, or N- 
Ras RNA sequences. 

Todd, International PCT Publication Nos. WO 01/49877, WO 99/50452, and WO 
99/45146 describes specific DNAzymes targeting K-Ras for diagnostic applications. 
JO Acquired immunodeficiency syndrome (ADDS) is thought to be caused by infection 

with the human immunodeficiency virus, for example HTV-1. Draper et al, U.S. Patent Nos. 
6,159,692, 5,972,704, 5,693,535, and International PCT Publication Nos. WO WO 93/23569, 
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WO 95/04818, describe enzymatic nucleic acid molecules targeting HIV. Todd et al, 
International PCT Publication No. WO 99/50452, describe methods for using specific 
DNAzyme motifs for detecting the presence of certain HTV RNAs. Sriram and Banerjea, 
2000, Biochem J. f 352, 667-673, describe specific RNA cleaving DNA enzymes targeting 
5 HIV-1. Zhang et al, 1999, FEES Lett, 458, 151-156, describe specific RNA cleaving DNA 
enzymes used in the inhibition of HIV-1 infection. 

HER2 (also known as neu, erbB2 and c-erbB2) is an oncogene that encodes a 185-kDa 
transmembrane tyrosine kinase receptor. HER2 is a member of the epidermal growth factor 
receptor (EGFR) family and shares partial homology with other family members. In normal 

10 adult tissues HER2 expression is low. However, HER2 is overexpressed in at least 25-30% 
of breast (McGuire, H.C. and Greene, M.I. (1989) The neu (c-erbB-2) oncogene. Semin. 
Oncol. 16: 148-155) and ovarian cancers (Berchuck, A. Kamel, A., Whitaker, R. et al 
(1990)). Overexpression of her-2/neu is associated with poor survival in advanced epithelial 
ovarian cancer. Cancer Research 50: 4087-4091). Furthermore, overexpression of HER2 in 

1 5 malignant breast tumors has been correlated with increased metastasis, chemoresistance and 
poor survival rates (Slamon et al., 1987 Science 235: 177-182). Because HER2 expression is 
high in aggressive human breast and ovarian cancers, but low in normal adult tissues, it is an 
attractive target for enzymatic nucleic acid-mediated therapy. McSwiggen et al, International 
PCT Publication No. WO 01/16312 and Beigelman et al, International PCT Publication No. 

20 WO 99/55857 describe enzymatic nucleic acid molecules targeting HER2. Thompson and 
Draper, US Patent No. 5,599,704, describes enzymatic nucleic acid molecules targeting 
HER2 (erbB2/neu) gene expression. 

Summary Of The Invention 

The present invention features nucleic acid molecules, including, for example, antisense 
15 oligonucleotides, siRNA, aptamers, decoys and enzymatic nucleic acid molecules such as 
DNAzyme enzymatic nucleic acid molecules, which modulate expression of nucleic acid 
molecules encoding Ras oncogenes, such as K-Ras, H-Ras, and N-Ras. In one embodiment, 
the invention features an enzymatic nucleic acid molecule comprising a sequence selected 
from the group consisting of SEQ ID NOs: 2329-4655. 
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In another embodiment, the invention features an enzymatic nucleic acid molecule 
comprising at least one binding arm having a sequence complementary to a sequence selected 
from the group consisting of SEQ ID NOs: 1-2328. 

In another embodiment, the invention features a siRNA molecule having 
5 complementarity to a sequence selected from the group consisting of SEQ ID NOs: 1-2328. 

In another embodiment, the invention features an antisense molecule having 
complementarity to a sequence selected from the group consisting of SEQ ID NOs: 1-2328. 

In another aspect of the invention, the nucleic acid of the invention is adapted to treat 
cancer. 

10 In one embodiment, the enzymatic nucleic acid molecule of the invention has an 

endonuclease activity to cleave RNA having a K-Ras sequence. 

la another embodiment, the enzymatic nucleic acid molecule of the invention has an 
endonuclease activity to cleave RNA having an H-Ras sequence. 

In another embodiment, the enzymatic nucleic acid molecule of the invention has an 
1 5 endonuclease activity to cleave RNA having an N-Ras sequence. 

In one embodiment, the siRNA molecule of the invention has RNA interference activity 
to K-Ras expression. 

In another embodiment, the siRNA molecule of the invention has RNA interference 
activity to H-Ras expression. 

20 In another embodiment, the siRNA molecule of the invention has RNA interference 

activity to N-Ras expression. 

In one embodiment, a siRNA molecule of the invention comprises a double stranded 
RNA wherein one strand of the RNA is complementary to the RNA of K-Ras, H-Ras, and/or 
N-Ras gene, hi another embodiment, a siRNA molecule of the invention comprises a double 

25 stranded RNA wherein one strand of the RNA comprises a portion of a sequence of RNA of 
K-Ras, H-Ras, and/or N-Ras gene sequence. In yet another embodiment, a siRNA molecule 
of the invention comprises a double stranded RNA wherein both strands of RNA are 
connected by a non-nucleotide linker. Alternately, a siRNA molecule of the invention 
comprises a double stranded RNA wherein both strands of RNA are connected by a 

30 nucleotide linker, such as a loop or stem loop structure. 
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In one embodiment, a single strand component of a siRNA molecule of the invention is 
from about 14 to about 50 nucleotides in length. In another embodiment, a single strand 
component of a siRNA molecule of the invention is about 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, or 28 nucleotides in length. In yet another embodiment, a single strand 
5 component of a siRNA molecule of the invention is about 23 nucleotides in length. In one 
embodiment, a siRNA molecule of the invention is from about 28 to about 56 nucleotides in 
length. In another embodiment, a siRNA molecule of the invention is about 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, 50, 51, or 52 nucleotides in length. In yet another embodiment, a 
siRNA molecule of the invention is about 46 nucleotides in length. 

10 In one embodiment, the DNAzyme molecule of the invention is in a "10-23" 

configuration (see for example Santoro et al t 1997 PNAS, 94, 4262 and Joyce et al, US 
5,807,718). In another embodiment, the DNAzyme comprises a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NOs: 1-2328. In yet another 
embodiment, the DNAzyme comprises a sequence selected from the group consisting of SEQ 

15 ID NOs: 2329-4655. 

In another embodiment, the nucleic acid molecule of the invention comprises between 
12 and 100 bases complementary to a nucleic acid molecule having a K-Ras sequence. In yet 
another embodiment, the enzymatic nucleic acid comprises between 14 and 24 bases 
complementary to a nucleic acid molecule having a K-Ras sequence. 

20 In another embodiment, the nucleic acid molecule of the invention comprises between 

12 and 100 bases complementary to a nucleic acid molecule having an H-Ras sequence. In 
yet another embodiment, the nucleic acid molecule of the invention comprises between 14 
and 24 bases complementary to a nucleic acid molecule having an H-Ras sequence. 

In another embodiment, the nucleic acid molecule of the invention comprises between 
25 12 and 100 bases complementary to a nucleic acid molecule having an N-Ras sequence. In 
yet another embodiment, the nucleic acid molecule of the invention comprises between 14 
and 24 bases complementary to a nucleic acid molecule having an N-Ras sequence. 

In yet another embodiment, the nucleic acid molecule of the invention is chemically 
synthesized. The nucleic acid molecule can comprise at least one 2'-sugar modification, at 
30 least one nucleic acid base modification, and/or at least one phosphate backbone 
modification. 
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In one embodiment, the invention features a mammalian cell comprising the nucleic 
acid molecule of the invention. In another embodiment, the mammalian cell of the invention 
is a human cell. 

In another embodiment, the invention features a method of modulating K-Ras activity 
5 in a cell, comprising contacting the cell with the nucleic acid molecule of the invention, under 
conditions suitable for the modulation of K-Ras activity. 

In another embodiment, the invention features a method of modulating H-Ras activity 
in a cell, comprising contacting the cell with the nucleic acid molecule of the invention, under 
conditions suitable for the modulation of H-Ras activity. 

10 In another embodiment, the invention features a method of modulating N-Ras activity 

in a cell, comprising contacting the cell with the nucleic acid molecule of the invention, under 
conditions suitable for the modulation of N-Ras activity. 

In another embodiment, the invention features a method of treatment of a subject 
having a condition associated with the level of K-Ras, comprising contacting cells of the 
15 subject with the nucleic acid molecule of the invention, under conditions suitable for the 
treatment. 

In another embodiment, the invention features a method of treatment of a subject 
having a condition associated with the level of H-Ras, comprising contacting cells of the 
subject with the nucleic acid molecule of the invention, under conditions suitable for the 
20 treatment. 

In another embodiment, the invention features a method of treatment of a subject 
having a condition associated with the level of N-Ras, comprising contacting cells of the 
subject with the nucleic acid molecule of the invention, under conditions suitable for the 
treatment. 

25 hi one embodiment, a method of treatment of the invention further comprises the use of 

one or more drug therapies under conditions suitable for the treatment. 

In another embodiment, the invention features a method of cleaving RNA having a K- 
Ras sequence comprising contacting the K-Ras RNA with the enzymatic nucleic acid 
molecule of the invention under conditions suitable for the cleavage, for example, where the 
30 cleavage is carried out in the presence of a divalent cation, such as Mg2+. 
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In another embodiment, the invention features a method of cleaving RNA having a H- 
Ras sequence comprising contacting the H-Ras RNA with the enzymatic nucleic acid 
molecule of the invention under conditions suitable for the cleavage, for example, where the 
cleavage is carried out in the presence of a divalent cation, such as Mg2+. 

5 In another embodiment, the invention features a method of cleaving RNA having an N- 

Ras sequence comprising contacting the N-Ras RNA with the enzymatic nucleic acid 
molecule of the invention under conditions suitable for the cleavage, for example, where the 
cleavage is carried out in the presence of a divalent cation, such as Mg2+. 

In one embodiment, the nucleic acid molecule of the invention comprises a cap 
10 structure, for example, a 3 ',3 '-linked or 5 \5 '-linked deoxyabasic ribose derivative, wherein 
the cap structure is at the 5 '-end, 3 '-end, or both the 5 '-end and the 3 '-end of the nucleic acid 
molecule. 

In another embodiment, the invention features an expression vector comprising a 
nucleic acid sequence encoding at least one nucleic acid molecule of the invention in a 
1 5 manner that allows expression of the nucleic acid molecule. For example, the invention 
features an expression vector comprising a nucleic acid encoding a DNAzyme in a manner 
that allows expression of the DNAzyme. 

In yet another embodiment, the invention features a mammalian cell, for example a 
human cell, comprising an expression vector of the invention. 

20 In another embodiment, the expression vector of the invention further comprises a 

sequence for a nucleic acid molecule complementary to an RNA having K-Ras sequence. 

In another embodiment, the expression vector of the invention further comprises a 
sequence for a nucleic acid molecule complementary to an RNA having H-Ras sequence. 

In another embodiment, the expression vector of the invention further comprises a 
25 sequence for a nucleic acid molecule complementary to an RNA having N-Ras sequence. 

In one embodiment, an expression vector of the invention comprises a nucleic acid 
sequence encoding two or more nucleic acid molecules of the invention, which can be the 
same or different. In another embodiment, an expression vector of the invention further 
comprises a sequence encoding ah antisense nucleic acid molecule complementary to an RNA 
30 having a K-Ras, H-Ras or N-Ras sequence. 
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In another embodiment, the invention features a method for treating cancer, for 
example colorectal cancer, bladder cancer, lung cancer, pancreatic cancer, breast cancer, or 
prostate cancer, comprising administering to a subject a nucleic acid molecule of the 
invention under conditions suitable for the treatment. A method of treatment of cancer of the 
5 invention can further comprise administering to a patient one or more other therapies, for 
example, monoclonal antibody therapy, such as Herceptin (trastuzumab); chemotherapy, such 
as paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, cyclophosphamide, doxorubin, 
fluorouracil carboplatin, Leucovorin, Mnotecan (CAMPTOSAR® or CPT-11 or 
Camptothecin-11 or Campto), Carboplatin, edatrexate, gemcitabine, or vinorelbine; radiation 
1 0 therapy, or analgesic therapy and/or any combination thereof 

In another embodiment, the invention features a composition comprising a nucleic acid 
molecule of the invention in a pharmaceutically acceptable carrier. 

In one embodiment, the invention features a method of administering to a cell, for 
example a mammalian cell or human cell, the nucleic acid molecule of the invention 
1 5 comprising contacting the cell with the nucleic acid molecule under conditions suitable for 
administration. The method of administration can be in the presence of a delivery reagent, for 
example a lipid, cationic lipid, phospholipid, or liposome. 

The present invention features an enzymatic nucleic acid molecule which modulates 
expression of a nucleic acid molecule encoding a human immunodeficiency virus (HIV), for 
20 example HIV-1, HIV-2, and related viruses such as FIV-l and SIV-l, or a HIV gene, for 
example LTR, nef, vif, tat, or rev, wherein the enzymatic nucleic acid molecule comprises a 
DNAzyme configuration. 

The invention also features an enzymatic nucleic acid molecule which modulates 
expression of a nucleic acid molecule encoding HIV or a component of HTV such as net, vif, 
25 tat, or rev, wherein the enzymatic nucleic acid molecule is in a Inozyme, G-cleaver, Zinzyme, 
DNAzyme or Amberzyme configuration. 

The present invention also features a siRNA molecule which modulates expression of a 
nucleic acid molecule encoding a human immunodeficiency virus (HIV), for example HIV-1, 
HTV-2, and related viruses such as FIV-l and SIV-1, or a HIV gene, for example LTR, nef, 
30 vif, tat, or rev. 

The present invention features an enzymatic nucleic acid molecule comprising a 
sequence selected from the group consisting of SEQ ID NOs. 6727-6799. The invention also 
features an enzymatic nucleic acid molecule comprising at least one binding arm wherein one 
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or more of said binding arms comprises a sequence complementary to a sequence selected 
from the group consisting of SEQ ID NOs. 6642-6726. In addition, the present invention 
features a siRNA nucleic acid molecule comprising sequence complementary to a sequence 
selected from the group consisting of SEQ ID NOs. 1-76 and 140-148. 

5 In another embodiment, the siRNA molecule of the invention has RNA interference 

activity to HIV-1 expression and/or replication. 

In one embodiment, a siRNA molecule of the invention comprises a double stranded 
RNA wherein one strand of the RNA is complementary to the RNA of HIV-1 genome or 
genes. In another embodiment, a siRNA molecule of the invention comprises a double 

1 0 stranded RNA wherein one strand of the RNA comprises a portion of a sequence of HIV-1 
genome or gene sequence. In yet another embodiment, a siRNA molecule of the invention 
comprises a double stranded RNA wherein both strands of RNA are connected by a non- 
nucleotide linker. Alternately, a siRNA molecule of the invention comprises a double 
stranded RNA wherein both strands of RNA are connected by a nucleotide linker, such as a 

1 5 loop or stem loop structure. 

In one embodiment, a single strand component of a siRNA molecule of the invention is 
from about 14 to about 50 nucleotides in length. In another embodiment, a single strand 
component of a siRNA molecule of the invention is about 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, or 28 nucleotides in length. In yet another embodiment, a single strand 

20 component of a siRNA molecule of the invention is about 23 nucleotides in length. In one 
embodiment, a siRNA molecule of the invention is from about 28 to about 56 nucleotides in 
length. In another embodiment, a siRNA molecule of the invention is about 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, 50, 51, or 52 nucleotides in length. In yet another embodiment, a 
siRNA molecule of the invention is about 46 nucleotides in length. 

25 In one embodiment, a nucleic acid molecule of the invention is adapted to treat HTV 

infection or acquired immunodeficiency syndrome (AIDS). 

In another embodiment, the enzymatic nucleic acid molecule of the invention has an 
endonuclease activity to cleave RNA having HTV sequence. 

In yet another embodiment, the enzymatic nucleic acid molecule of the invention is in 

30 an Inozyme, Zinzyme, G-cleaver, Amberzyme, DNAzyme or Hammerhead configuration. 
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In another embodiment, the Inozyme of the invention comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID NOs. 6648-6655, 
or comprises a sequence selected from the group consisting of SEQ ID NOs. 6733-6740. 

In another embodiment, the Zinzyme of the invention comprises a sequence 
5 complementary to a sequence selected from the group consisting of SEQ ID NOs. 6656-6663 
and 6723-6726, or comprises a sequence selected from the group consisting of SEQ ID NOs 
6741-6748 and 6795-6799. 

In another embodiment, the Amberzyme of the invention comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID NOs. 6656-6688, 
10 or comprises a sequence selected from the group consisting of SEQ ID NOs. 6762-6789. 

In another embodiment, the DNAzyme of the invention comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID NOs. 6656-6668 
and 6718-6722, or comprises a sequence selected from the group consisting of SEQ ID NOs. 
6749-6761 and 6790-6794. 
15 In another embodiment, the Hammerhead of the invention comprises a sequence 

complementary to a sequence selected from the group consisting of SEQ ID NOs. 6642-6647, 
or comprises a sequence selected from the group consisting of SEQ ID NOs 6727-6732. 

In one embodiment, a nucleic acid molecule of the invention comprises between 12 and 
100 bases complementary to a RNA sequence encoding HIV genome, RNA, and/or proteins. 
20 In another embodiment, a nucleic acid molecule of the invention comprises between 14 and 
24 bases complementary to a RNA sequence encoding HIV genome, RNA, and/or proteins. 

In yet another embodiment, a nucleic acid molecule of the invention is chemically 
synthesized. A nucleic acid molecule of the invention can comprise at least one 2 '-sugar 
modification, at least one nucleic acid base modification, and/or at least one phosphate 
25 backbone modification. 

The present invention features a mammalian cell including a nucleic acid molecule of 
the invention. In one embodiment, the mammalian cell of the invention is a human cell. 

The invention features a method of reducing HTV activity in a cell, comprising 
contacting the cell with a nucleic acid molecule of the invention, under conditions suitable for 
30 the reduction of HIV activity. 
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The invention also features a method of treating a subject having a condition associated 
with the level of HIV, comprising contacting cells of the subject with a nucleic acid molecule 
of the invention, under conditions suitable for the treatment. 

In one embodiment, methods of treatment contemplated by the invention comprise the 
5 use of one or more drug therapies under conditions suitable for the treatment. 

The invention features a method of cleaving RNA comprising a HIV nucleic acid 
sequence comprising contacting an enzymatic nucleic acid molecule of the invention with the 
RNA under conditions suitable for the cleavage. In one embodiment, the cleavage 
contemplated by the invention is carried out in the presence of a divalent cation, for example 
10 Mg 2+ . 

The present invention features a method for treatment of acquired immunodeficiency 
syndrome (AIDS) or an ADDS related condition, for example Kaposi's sarcoma, lymphoma, 
cervical cancer, squamous cell carcinoma, cardiac myopathy, rheumatic disease, or 
opportunistic infection, comprising administering to a subject a nucleic acid molecule of the 
1 5 invention under conditions suitable for the treatment. 

In one embodiment, nucleic acid molecule of the invention comprises at least five 
ribose residues, at least ten 2-O-methyl modifications, and a 3'- end modification, for 
example a 3 '-3' inverted abasic moiety. 

In another embodiment, a nucleic acid molecule of the invention further comprises 
20 phosphorothioate linkages on at least three of the 5' terminal nucleotides. 

In yet another embodiment, a DNAzyme of the invention comprises at least ten 2'-0- 
methyl modifications and a 3 '-end modification, for example a 3 '-3' inverted abasic moiety. 
In a further embodiment, the DNAzyme of the invention further comprises phosphorothioate 
linkages on at least three of the 5* terminal nucleotides. 
25 In another embodiment, other drug therapies of the invention comprise antiviral 

therapy, monoclonal antibody therapy, chemotherapy, radiation therapy, analgesic therapy, or 
anti-inflammatory therapy. 

In yet another embodiment, antiviral therapy of the invention comprises treatment with 
AZT, ddC, ddl, d4T, 3TC, Ribavirin, delvaridine, nevirapine, efravirenz, ritonavir, saquinivir, 
30 indinavir, amprenivir, nelfinavir, or lopinavir. 

The invention features a composition comprising a nucleic acid molecule of the 
invention in a pharmaceutically acceptable carrier. 
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In one embodiment, the invention features a method of administering to a cell, for 
example a mammalian cell or human cell, an enzymatic nucleic acid molecule of the 
invention comprising contacting the cell with the enzymatic nucleic acid molecule under 
conditions suitable for the administration. The method of administration can be in the 
5 presence of a delivery reagent, for example a lipid, cationic lipid, phospholipid, or liposome. 

The present invention features enzymatic nucleic acid molecules which modulate 
expression of nucleic acid molecules encoding HER2. The present invention also features 
siRNA molecules which modulate the expression of nucleic acid molecules encoding HER2. 

In another embodiment, the invention features a siRNA molecule having 
10 complementarity to a sequence selected from the group consisting of SEQ ID NOs: 4656- 
5643 and 6632-6636. 

In one embodiment, the invention features an enzymatic nucleic acid molecule 
comprising a sequence selected from the group consisting of SEQ ID NOs: 5644-6631 and 
6637-6641. 

15 In another embodiment, the invention features an enzymatic nucleic acid molecule 

comprising at least one binding arm having a sequence complementary to a sequence selected 
from the group consisting of SEQ ID NOs: 4656-5643 and 6632-6636. 

In yet another embodiment, a nucleic acid of the invention is adapted to treat cancer. 

In another embodiment, an enzymatic nucleic acid molecule of the invention has an 
20 endonuclease activity to cleave RNA having HER2 sequence. 

In another embodiment, the siRNA molecule of the invention has RNA interference 
activity to N-Ras gene expression. 

In one embodiment, a siRNA molecule of the invention comprises a double stranded 
RNA wherein one strand of the RNA is complementary to the RNA of HER2 gene. In 

25 another embodiment, a siRNA molecule of the invention comprises a double stranded RNA 
wherein one strand of the RNA comprises a portion of a sequence of RNA having of HER2 
gene sequence. In yet another embodiment, a siRNA molecule of the invention comprises a 
double stranded RNA wherein both strands of RNA are connected by a non-nucleotide linker. 
Alternately, a siRNA molecule of the invention comprises a double stranded RNA wherein 

30 both strands of RNA are connected by a nucleotide linker, such as a loop or stem loop 
structure. 
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In one embodiment, a single strand component of a siRNA molecule of the invention is 
from about 14 to about 50 nucleotides in length. In another embodiment, a single strand 
component of a siRNA molecule of the invention is about 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, or 28 nucleotides in length. In yet another embodiment, a single strand 
5 component of a siRNA molecule of the invention is about 23 nucleotides in length. In one 
embodiment, a siRNA molecule of the invention is from about 28 to about 56 nucleotides in 
length. In another embodiment, a siRNA molecule of the invention is about 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, 50, 51, or 52 nucleotides in length. In yet another embodiment, a 
siRNA molecule of the invention is about 46 nucleotides in length. 

10 In one embodiment, a DNAzyme molecule of the invention is in a "10-23" 

configuration. In another embodiment, a DNAzyme of the invention comprises a sequence 
complementary to a sequence having SEQ ID NOs: 4656-5643 and 6632-6636. In yet another 
embodiment, a DNAzyme molecule of the invention comprises a sequence having SEQ ID 
NOs: 5644-6631 and 6637-6641. 

15 In another embodiment, a nucleic acid molecule of the invention comprises between 12 

and 100 bases complementary to a nucleic acid molecule having HER2 sequence. In yet 
another embodiment, a nucleic acid molecule of the invention comprises between 14 and 24 
bases complementary to a nucleic acid molecule having HER2 sequence. 

In yet another embodiment, a nucleic acid molecule of the invention is chemically 
20 synthesized. A nucleic acid molecule of the invention can comprise at least one 2'-sugar 
modification, at least one nucleic acid base modification, and/or at least one phosphate 
backbone modification. 

In one embodiment, the invention features a mammalian cell comprising a nucleic acid 
molecule of the invention. In another embodiment, the mammalian cell of the invention is a 
25 human cell. 

In another embodiment, the invention features a method of reducing HER2 activity in a 
cell, comprising contacting the cell with the nucleic acid molecule of the invention, under 
conditions suitable for the reduction of HER2 activity. 

In another embodiment, the invention features a method of treatment of a subject 
30 having a condition associated with the level of HER2, comprising contacting cells of the 
subject with the nucleic acid molecule of the invention, under conditions suitable for the 
treatment. 
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In one embodiment, a method of treatment of the invention further comprises the use of 
one or more drug therapies under conditions suitable for the treatment. 

In another embodiment, the invention features a method of cleaving RNA having HER2 
sequence comprising contacting an enzymatic nucleic acid molecule of the invention with the 
5 RNA under conditions suitable for the cleavage, for example, where the cleavage is carried 
out in the presence of a divalent cation, such as Mg2+. 

In one embodiment, a nucleic acid molecule of the invention comprises a cap structure, 
for example a 3 ',3Minked or 5\ 5 '-linked deoxyabasic ribose derivative, wherein the cap 
structure is at the 5'-end, 3'-end, or both the 5'-end and the 3'-end of the enzymatic nucleic 
10 acid molecule. 

In another embodiment, the invention features an expression vector comprising a 
nucleic acid sequence encoding at least one nucleic acid molecule of the invention, for 
example a DNAzyme or siRNA molecule, in a manner that allows expression of the nucleic 
acid molecule. 

15 In yet another embodiment, the invention features a mammalian cell, for example a 

human cell, comprising an expression vector of the invention. 

In another embodiment, an expression vector of the invention further comprises a 
sequence for a nucleic acid molecule complementary to a nucleic acid molecule having HER2 
sequence. 

20 In one embodiment, an expression vector of the invention comprises a nucleic acid 

sequence encoding two or more nucleic acid molecules, which can be the same or different. In 
another embodiment, an expression vector of the invention further comprises a sequence 
encoding an antisense nucleic acid molecule complementary to a nucleic acid molecule 
having a HER2 sequence. 

25 In another embodiment, the invention features a method for treating cancer, for 

example breast cancer or ovarian cancer, comprising administering to a subject a nucleic acid 
molecule of the invention under conditions suitable for the treatment. A method of treatment 
of cancer of the invention can further comprise administering to a patient one or more other 
therapies, for example, monoclonal antibody therapy, such as Herceptin (trastuzumab); 

30 chemotherapy, such as paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, Leucovorin, Irinotecan 
(CAMPTOSAR® or CPT-11 or Camptothecin-11 or Campto), Carboplatin, edatrexate, 
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gemcitabine, or vinorelbine; radiation therapy, or analgesic therapy and/or any combination 
thereof. 

In another embodiment, the invention features a composition comprising a nucleic acid 
molecule of the invention in a pharmaceutically acceptable carrier. 
5 hi one embodiment, the invention features a method of administering to a cell, for 

example a mammalian cell or human cell, a nucleic acid molecule of the invention 
comprising contacting the cell with the nucleic acid molecule under conditions suitable for 
administration. The method of administration can be in the presence of a delivery reagent, for 
example a lipid, cationic lipid, phospholipid, or liposome. 

10 

Detailed Description of the Invention 
First the drawings will be described briefly. 
Drawings 

Figure 1 shows examples of chemically stabilized ribozyme motifs. HH Rz, represents 
15 hammerhead ribozyme motif (Usman et al, 1996, Curr. Op. Struct. Bio., 1, 527); NCH Rz 
represents the NCH ribozyme motif (Ludwig et al, International PCT Publication No. WO 
98/58058 and US Patent Application Serial No. 08/878,640); G-Cleaver, represents G- 
cleaver ribozyme motif (Kore et al, 1998, Nucleic Acids Research 26, 41 16-4120, Eckstein et 
al, US 6,127,173). N or n, represent independently a nucleotide which can be same or 
20 different and have complementarity to each other; rl, represents ribo-Inosine nucleotide; 
arrow indicates the site of cleavage within the target. Position 4 of the HH Rz and the NCH 
Rz is shown as having 2'-C-allyl modification, but those skilled in the art will recognize that 
this position can be modified with other modifications well known in the art, so long as such 
modifications do not significantly inhibit the activity of the ribozyme. 

25 Figure 2 shows an example of the Amberzyme ribozyme motif that is chemically 

stabilized (see for example Beigelman et al, International PCT publication No. WO 
99/55857 and US Patent Application Serial No. 09/476,387.). 

Figure 3 shows an example of a Zinzyme A ribozyme motif that is chemically 
stabilized (see for example Beigelman et al, Internationa] PCT publication No. WO 
30 99/55857 and US Patent Application Serial No. 09/918,728). 
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Figure 4 shows an example of a DNAzyme motif described by Santoro et ai, 1997, 
PNAS, 94, 4262 and Joyce et al. t US 5,807,718 . 

The invention features novel nucleic acid molecules, including antisense 
oligonucleotides, siRNA and enzymatic nucleic acid molecules, and methods to modulate 
5 gene expression, for example, genes encoding K-Ras, H-Ras and/or N-Ras. In particular, the 
instant invention features nucleic-acid based molecules and methods to down-regulate the 
expression of K-Ras, H-Ras and/or N-Ras gene sequences. 

The invention features one or more nucleic acid-based molecules and methods that 
independently or in combination modulate the expression of a gene or genes encoding Ras 
1 0 proteins. In particular embodiments, the invention features nucleic acid-based molecules and 
methods that modulate the expression of K-Ras gene, for example, Genbank Accession No. 
NM_004985; H-Ras gene, for example, Genbank Accession No. NM_005343; and/or N-Ras 
gene, for example, Genbank Accession No. NM_002524. 

The description below of the various aspects and embodiments is provided with 
1 5 reference to exemplary K-Ras, H-Ras, and N-Ras genes, referred to hereinafter collectively as 
Ras. However, the various aspects and embodiments are directed to equivalent sequences and 
also to other genes which encode K-Ras, H-Ras and/or N-Ras proteins and similar proteins to 
K-Ras, H-Ras and/or N-Ras. For example, the invention relates to genes with homology to 
genes that encode K-Ras, H-Ras and/or N-Ras and genes that encode proteins with similar 
20 function to K-Ras, H-Ras, and N-Ras proteins. Those additional genes can be analyzed for 
target sites using the methods described herein. Thus, the modulation and the effects of such 
modulation of the other genes can be determined as described herein. 

In one embodiment, the invention features the use of an enzymatic nucleic acid 
molecule, preferably in the hammerhead, NCH, G-cleaver, amberzyme, zinzyme and/or 
25 DNAzyme motif, to modulate the expression of a Ras gene or inhibit Ras activity. In one 
embodiment, the invention features the use of these enzymatic nucleic acid molecules to 
down-regulate the expression of a Ras gene or inhibit Ras activity. In another embodiment, 
the invention features the use of an antisense oligonucleotide molecule to modulate, for 
example, down-regulate, the expression of a Ras gene or inhibit Ras activity. 

30 The invention features novel enzymatic nucleic acid molecules, siRNA molecules, and 

methods to modulate expression and/or activity of human immunodeficiency virus (HIV), for 
example HIV-1, HIV-2, and related viruses such as FIV-1 and SIV-1, or a HIV gene, for 
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example LTR, nef, vifi tat, or rev. In particular, the instant invention features nucleic-acid 
based molecules and methods to inhibit the replication of a HIV or related virus. 

The invention features one or more nucleic acid-based molecules and methods that 
independently or in combination modulate the expression of gene(s) encoded by HIV and/or 
5 inhibit the replication of HIV. In particular embodiments, the invention features nucleic acid- 
based molecules and methods that modulate the expression of HIV-1 encoded genes, for 
example (Genbank Accession No. AJ302647); HIV-2 gene, for example (Genbank Accession 
No. NCJ)01722), FIV-1, for example (Genbank Accession No. NCJXM482), SIV-1, for 
example (Genbank Accession No. M66437), LTR, for example included in (Genbank 
1 0 Accession No. AJ302647), nef 9 for example included in (Genbank Accession No. AJ302647), 
vif, for example included in (Genbank Accession No. AJ302647), tat, for example included in 
(Genbank Accession No. AJ302647), and rev, for example included in (Genbank Accession 
No. AJ302647). 

The description below of the various aspects and embodiments is provided with 
15 reference to the exemplary HTV-1 gene, referred to herein as HTV. However, the various 
aspects and embodiments are also directed to other genes which encode HTV proteins and 
similar viruses to HIV. Those additional genes can be analyzed for target sites using the 
methods described for HIV. Thus, the inhibition and the effects of such inhibition of the 
other genes can be performed as described herein. 

20 Due to the high sequence variability of the HIV genome, selection of nucleic acid 

molecules for broad therapeutic applications would likely involve the conserved regions of 
the HTV genome. Specifically, the present invention describes nucleic acid molecules that 
cleave the conserved regions of the HIV genome. Therefore, one nucleic acid molecule can 
be designed to cleave all the different isolates of HTV. Nucleic acid molecules designed 

25 against conserved regions of various HTV isolates can enable efficient inhibition of HIV 
replication in diverse subject populations and can ensure the effectiveness of the nucleic acid 
molecules against HTV quasi species which evolve due to mutations in the non-conserved 
regions of the HTV genome. 

In one embodiment, the invention features the use of an enzymatic nucleic acid 
30 molecule, preferably in the hammerhead, NCH, G-cleaver, amberzyme, zinzyme and/or 
DNAzyme motif, to down-regulate the expression of HTV genes or inhibit the replication of 
HIV. 
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The invention features novel nucleic acid molecules, siRNA molecules and methods to 
modulate gene expression, for example, genes encoding HER2. In particular, the instant 
invention features nucleic-acid based molecules and methods to inhibit the expression of 
HER2. 

5 The invention features one or more nucleic acid-based molecules and methods that 

independently or in combination modulate the expression of a gene or genes encoding HER2. 
In particular embodiments, the invention features nucleic acid-based molecules and methods 
that modulate the expression of HER2 gene, for example, Genbank Accession No. 
NM_004448. 

10 The description below of the various aspects and embodiments is provided with 

reference to an exemplary HER2 gene, referred to herein as HER2 but also known as ERB2, 
ERB-B2, NEU, NGL, and v-ERB-62. However, the various aspects and embodiments are 
also directed to other genes which encode HER2 proteins and similar proteins to HER2. 
Those additional genes can be analyzed for target sites using the methods described for 

1 5 HER2. Thus, the inhibition and the effects of such inhibition of the other genes can be 
performed as described herein. 

In one embodiment, the invention features the use of an enzymatic nucleic acid 
molecule, preferably in the hammerhead, NCH, G-cleaver, amberzyme, zinzyme and/or 
DNAzyme motif, to down-regulate the expression of HER2 genes or inhibit HER2 activity. 

20 By "modulate" is meant that the expression of the gene, or level of RNAs or 

equivalent RNAs encoding one or more protein subunits or components, or activity of one or 
more proteins is up-regulated or down-regulated, such that the expression, level, or activity is 
greater than or less than that observed in the absence of the nucleic acid molecules of the 
invention. 

25 By "inhibit" or "down-regulate" it is meant that the expression of the gene, or level of 

RNAs or equivalent RNAs encoding one or more protein subunits or components, or activity 
of one or more protein subunits or components, such as Ras, HIV, and/or HER2 protein or 
proteins, is reduced below that observed in the absence of the nucleic acid molecules of the 
invention. In one embodiment, inhibition or down-regulation with the enzymatic nucleic acid 

30 molecule preferably is below that level observed in the presence of an enzymatically inactive 
or attenuated enzymatic nucleic acid molecule that is able to bind to the same site on the 
target RNA, but is unable to cleave that RNA. In another embodiment, inhibition or down- 
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regulation with an antisense oligonucleotide is preferably below that level observed in the 
presence of, for example, an oligonucleotide with scrambled sequence or with mismatches. In 
another embodiment, inhibition or down-regulation with an siRNA molecule is preferably 
below that level observed in the presence of, for example, an oligonucleotide with scrambled 
5 sequence or with mismatches. In another embodiment, inhibition or down-regulation of Ras, 
HTV, or HER2 expression and/or activity with the nucleic acid molecule of the instant 
invention is greater in the presence of the nucleic acid molecule than in its absence. 

By "up-regulate" is meant that the expression of the gene, or level of RNAs or 
equivalent RNAs encoding one or more protein subunits or components, or activity of one or 
10 more protein subunits or components, such as Ras, HIV, or HER2 protein or proteins, is 
greater than that observed in the absence of the nucleic acid molecules of the invention. For 
example, the expression of a gene, such as Ras, HIV, or HER2 gene, can be increased in order 
to treat, prevent, ameliorate, or modulate a pathological condition caused or exacerbated by 
an absence or low level of gene expression. 

15 By "enzymatic nucleic acid molecule" as used herein, is meant a nucleic acid molecule 

which has complementarity in a substrate binding region to a specified gene target, and also 
has an enzymatic activity which is active to specifically cleave target RNA. That is, the 
enzymatic nucleic acid molecule is able to intermolecularly cleave RNA and thereby 
inactivate a target RNA molecule. These complementary regions allow sufficient 

20 hybridization of the enzymatic nucleic acid molecule to the target RNA and thus permit 
cleavage. One hundred percent complementarity is preferred, but complementarity as low as 
50-75% can also be useful in this invention (see for example Werner and Uhlenbeck, 1995, 
Nucleic Acids Research, 23, 2092-2096; Hammann et al, 1999, Antisense and Nucleic Acid 
Drug Dev., 9, 25-31). The nucleic acids can be modified at the base, sugar, and/or phosphate 

25 groups. The term DNAzyme-based enzymatic nucleic acid is used interchangeably with 
phrases such as catalytic DNA, aptazyme or aptamer-binding DNAzyme, regulatable 
DNAzyme, catalytic oligonucleotides, nucleozyme, DNAzyme, endoribonuclease, 
endonuclease, minizyme, leadzyme, oligozyme or DNA enzyme. All of these terminologies 
describe nucleic acid molecules with enzymatic activity. The specific enzymatic nucleic acid 

30 molecules described in the instant application are not limiting in the invention and those 
skilled in the art will recognize that all that is important in an enzymatic nucleic acid 
molecule of this invention is that it have a specific substrate binding site which is 
complementary to one or more of the target nucleic acid regions, and that it have nucleotide 
sequences within or surrounding that substrate binding site which impart a nucleic acid 

35 cleaving and/or ligation activity to the molecule. 
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By "nucleic acid molecule" as used herein is meant a molecule having nucleotides. The 
nucleic acid can be single, double, or multiple stranded and can comprise modified or 
unmodified nucleotides or non-nucleotides or various mixtures and combinations thereof. 

By "enzymatic portion" or "catalytic domain" is meant that portion/region of the 
5 enzymatic nucleic acid molecule essential for cleavage of a nucleic acid substrate (for 
example see Figures 1-4). 

By "substrate binding arm" or "substrate binding domain" is meant that portion/region 
of a enzymatic nucleic acid which is able to interact, for example via complementarity (i.e., 
able to base-pair with), with a portion of its substrate. Preferably, such complementarity is 

10 100%, but can be less if desired. For example, as few as 10 bases out of 14 can be base-paired 
(see for example Werner and Uhlenbeck, 1995, Nucleic Acids Research, 23, 2092-2096; 
Hammann et al, 1999, Antisense and Nucleic Acid Drug Dev., 9, 25-31). Examples of such 
arms are shown generally in Figures 1-3. That is, these arms contain sequences within a 
enzymatic nucleic acid which are intended to bring enzymatic nucleic acid and target RNA 

15 together through complementary base-pairing interactions. The enzymatic nucleic acid of the 
invention can have binding arms that are contiguous or non-contiguous and can be of varying 
lengths. The length of the binding arm(s) are preferably greater than or equal to four 
nucleotides and of sufficient length to stably interact with the target RNA; preferably 12-100 
nucleotides; more preferably 14-24 nucleotides long (see for example Werner and Uhlenbeck, 

10 supra; Hamman et al, supra; Hampel et al., EP0360257; Berzal-Herranz et al, 1993, EMBO 
J., 12, 2567-73). If two binding arms are chosen, the design is such that the length of the 
binding arms are symmetrical (i.e., each of the binding arms is of the same length; e.g., five 
and five nucleotides, or six and six nucleotides, or seven and seven nucleotides long) or 
asymmetrical (i.e., the binding arms are of different length; e.g., six and three nucleotides; 

15 three and six nucleotides long; four and five nucleotides long; four and six nucleotides long; 
four and seven nucleotides long; and the like). 

By "Inozyme" or "NCH" motif or configuration is meant, an enzymatic nucleic acid 
molecule comprising a motif as is generally described as NCH Rz in Figure 1 and in Ludwig 
et al, International PCT Publication No. WO 98/58058 and US Patent Application Serial No. 
30 08/878,640. Inozymes possess endonuclease activity to cleave nucleic acid substrates having 
a cleavage triplet NCH/, where N is a nucleotide, C is cytidine and H is adenosine, uridine or 
cytidine, and "/" represents the cleavage site. H is used interchangeably with X. Inozymes 
can also possess endonuclease activity to cleave nucleic acid substrates having a cleavage 
triplet NCN/, where N is a nucleotide, C is cytidine, and "/" represents the cleavage site. "I" 
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in Figure 1 represents an Inosine nucleotide, preferably a ribo-Inosine or xylo-Inosine 
nucleoside. 

By "G-cleaver" motif or configuration is meant, an enzymatic nucleic acid molecule 
comprising a motif as is generally described as G-cleaver Rz in Figure 1 and in Eckstein et 
5 al t US 6,127,173. G-cleavers possess endonuclease activity to cleave nucleic acid substrates 
having a cleavage triplet NYN/, where N is a nucleotide, Y is uridine or cytidine and "/" 
represents the cleavage site. G-cleavers can be chemically modified as is generally shown in 
Figure 1. 

By "amberzyme" motif or configuration is meant, an enzymatic nucleic acid molecule 
10 comprising a motif as is generally described in Figure 2 and in Beigelman et al t 
International PCT publication No. WO 99/55857 and US Patent Application Serial No. 
09/476,387. Amberzymes possess endonuclease activity to cleave nucleic acid substrates 
having a cleavage triplet NG/N, where N is a nucleotide, G is guanosine, and "/" represents 
the cleavage site. Amberzymes can be chemically modified to increase nuclease stability 
15 through substitutions as are generally shown in Figure 2. In addition, differing nucleoside 
and/or non-nucleoside linkers can be used to substitute the 5'-gaaa-3' loops shown in the 
figure. Amberzymes represent a non-limiting example of an enzymatic nucleic acid molecule 
that does not require a ribonucleotide (2' -OH) group within its own nucleic acid sequence for 
activity. 

10 By "zinzyme" motif or configuration is meant, an enzymatic nucleic acid molecule 

comprising a motif as is generally described in Figure 3 and in Beigelman et al, International 
PCT publication No. WO 99/55857 and US Patent Application Serial No. 09/918,728. 
Zinzymes possess endonuclease activity to cleave nucleic acid substrates having a cleavage 
triplet including but not limited to YG/Y, where Y is uridine or cytidine, and G is guanosine 

25 and "/" represents the cleavage site. Zinzymes can be chemically modified to increase 
nuclease stability through substitutions as are generally shown in Figure 3, including 
substituting 2 , -0-methyl guanosine nucleotides for guanosine nucleotides. In addition, 
differing nucleotide and/or non-nucleotide linkers can be used to substitute the 5'-gaaa-2' 
loop shown in the figure. Zinzymes represent a non-limiting example of an enzymatic nucleic 

iO acid molecule that does not require a ribonucleotide (2 5 -OH) group within its own nucleic 
acid sequence for activity. 

By 'DNAzyme' is meant, an enzymatic nucleic acid molecule that does not require the 
presence of a 2' -OH group within its own nucleic acid sequence for activity. In particular 
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embodiments the enzymatic nucleic acid molecule can have an attached linker or linkers or 
other attached or associated groups, moieties, or chains containing one or more nucleotides 
with 2'-OH groups. DNAzymes can be synthesized chemically or expressed endogenously in 
vivo, by means of a single stranded DNA vector or equivalent thereof. An example of a 
5 DNAzyme is shown in Figure 4 and is generally reviewed in Usman et al, US patent No., 
6,159,714; Chartrand et al, 1995, NAR 23, 4092; Breaker et al, 1995, Chem. Bio. 2, 655; 
Santoro et al, 1997, PNAS 94, 4262; Breaker, 1999, Nature Biotechnology, 17, 422-423; and 
Santoro et al, 2000, J. Am. Chem. Soc, 122, 2433-39. The "10-23" DNAzyme motif is one 
particular type of DNAzyme that was evolved using in vitro selection, see Santoro et al, 

10 supra and as generally described in Joyce et al, US 5,807,718. Additional DNAzyme motifs 
can be selected by using techniques similar to those described in these references, and hence, 
are within the scope of the present invention. DNAzymes of the invention can comprise 
nucleotides modified at the nucleic acid base, sugar, or phosphate backbone. Non-limiting 
examples of sugar modifications that can be used in DNAzymes of the invention include 2'- 

1 5 O-alkyl modifications such as 2'-0-methyl or 2'-0-allyl, 2'-C-alkyl modifications such as 2'- 
C-allyl, 2'-deoxy-2' -amino, 2 '-halo modifications such as 2'-ftuoro, 2'-chloro, or 2'-bromo, 
isomeric modifications such as arabinofuranose or xylofuranose based nucleic acids, and 
other sugar modifications such as 4'-thio or 4'-carbocyclic nucleic acids. Non-limiting 
examples of nucleic acid based modifications that can be used in DNAzymes of the invention 

20 include modified purine heterocycles, G-clamp heterocycles, and various modified pyrimidine 
cycles. Non-limiting examples of backbone modifications that can be used in DNAzymes of 
the invention include phosphorothioate, phosphorodithioate, phosphoramidate, and 
methylphosphonate internucleotide linkages. DNAzymes of the invention can comprise 
naturally occurring nucleic acids, chimeras of chemically modified and naturally occurring 

25 nucleic acids, or completely modified nucleic acids. 

In general, enzymatic nucleic acids act by first binding to a target RNA. Such binding 
occurs through the target binding portion of a enzymatic nucleic acid that is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. Thus, 
the enzymatic nucleic acid first recognizes and then binds a target RNA through 

30 complementary base-pairing, and once bound to the correct site, acts enzymatically to cut the 
target RNA. Strategic cleavage of such a target RNA will destroy its ability to direct 
synthesis of an encoded protein. After an enzymatic nucleic acid has bound and cleaved its 
RNA target, it is released from that RNA to search for another target and can repeatedly bind 
and cleave new targets. Thus, a single enzymatic nucleic acid molecule is able to cleave 

35 many molecules of target RNA. In addition, the enzymatic nucleic acid molecule is a highly 
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specific inhibitor of gene expression, with the specificity of inhibition depending not only on 
the base-pairing mechanism of binding to the target RNA, but also on the mechanism of 
target RNA cleavage. Single mismatches, or base-substitutions, near the site of cleavage can 
completely eliminate catalytic activity of an enzymatic nucleic acid molecule. 

5 By "sufficient length" is meant an oligonucleotide of greater than or equal to 3 

nucleotides that is of a length great enough to provide the intended function under the 
expected condition. For example, for binding arms of enzymatic nucleic acid "sufficient 
length" means that the binding arm sequence is long enough to provide stable binding to a 
target site under the expected binding conditions. Preferably, the binding arms are not so 
1 0 long as to prevent useful turnover of the nucleic acid molecule. 

By "stably interact" is meant interaction of oligonucleotides with target nucleic acid 
molecules (e.g., by forming hydrogen bonds with complementary nucleotides in the target 
under physiological conditions) that is sufficient to the intended purpose (e.g., cleavage of 
target RNA by an enzyme). 

15 By "equivalent" RNA to Ras is meant to include those naturally occurring RNA 

molecules having homology (partial or complete) to Ras nucleic acids or encoding for 
proteins with similar function as Ras proteins in various organisms, including humans, 
rodents, primates, rabbits, pigs, protozoans, fungi, plants, and other microorganisms and 
parasites. The equivalent RNA sequence can also include, in addition to the coding region, 

20 regions such as a 5 '-untranslated region, a 3' -untranslated region, introns, a intron-exon 
junction and the like. 

By "equivalent" RNA to HIV is meant to include those naturally occurring RNA 
molecules having homology (partial or complete) to HIV nucleic acids or encoding for 
proteins with similar function as HIV proteins in various organisms, including human, rodent, 
25 primate, rabbit, pig, protozoans, fungi, plants, and other microorganisms and parasites. The 
equivalent RNA sequence also includes in addition to the coding region, regions such as 5'- 
untranslated region, 3 '-untranslated region, introns, intron-exon junction and the like. 

By "equivalent" RNA to HER2 is meant to include those naturally occurring RNA 
molecules having homology (partial or complete) to HER2 nucleic acids or encoding for 
30 proteins with similar function as HER2 proteins in various organisms, including humans, 
rodents, primates, rabbits, pigs, protozoans, fungi, plants, and other microorganisms and 
parasites. The equivalent RNA sequence also includes, in addition to the coding region, 
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regions such as a 5 '-untranslated region, a 3 '-untranslated region, introns, a intron-exon 
junction and the like. 

By "homology" is meant the nucleotide sequence of two or more nucleic acid molecules 
is partially or completely identical. 

5 By "component" of HIV is meant a peptide or protein expressed from an HIV gene, for 

example nef f vif t tat, or rev viral gene products. 

By "component" of HER2 is meant a peptide or protein subunit expressed from a HER2 

gene. 

By "component" of Ras is meant a peptide or protein subunit expressed from a Ras 

1 0 gene. 

By "gene" it is meant a nucleic acid that encodes an RNA, for example, nucleic acid 
sequences including but not limited to structural genes encoding a polypeptide. 

"Complementarity" refers to the ability of a nucleic acid to form hydrogen bond or 
bonds with another RNA sequence by either traditional Watson-Crick or other non-traditional 

1 5 types. In reference to the nucleic molecules of the present invention, the binding free energy 
for a nucleic acid molecule with its target or complementary sequence is sufficient to allow 
the relevant function of the nucleic acid to proceed, e.g., enzymatic nucleic acid cleavage, 
antisense or triple helix inhibition. Determination of binding free energies for nucleic acid 
molecules is well known in the art (see, e.g., Turner et al, 1987, CSH Symp. Quant. Biol. LIT 

10 pp.123-133; Frier et al, 1986, Proc. Nat. Acad. Set USA 83:9373-9377; Turner et al, 1987, 
J. Am. Chem. Soc. 109:3783-3785). A percent complementarity indicates the percentage of 
contiguous residues in a nucleic acid molecule that can form hydrogen bonds {e.g., Watson- 
Crick base pairing) with a second nucleic acid sequence {e.g., 5, 6, 7, 8, 9, 10 out of 10 being 
50%, 60%, 70%, 80%, 90%, and 100% complementary). "Perfectly complementary" means 

15 that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same 
number of contiguous residues in a second nucleic acid sequence. 

By "RNA" is meant a molecule comprising at least one ribonucleotide residue. By 
"ribonucleotide" or "2'-OH" is meant a nucleotide with a hydroxyl group at the T position of 
a p-D-ribo-furanose moiety. 

30 By "decoy " is meant a nucleic acid molecule, for example RNA or DNA, or aptamer 

that is designed to preferentially bind to a predetermined ligand. Such binding can result in 
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the inhibition or activation of a target molecule. A decoy or aptamer can compete with a 
naturally occurring binding target for the binding of a specific ligand. For example, it has 
been shown that over-expression of HTV trans-activation response (TAR) RNA can act as a 
"decoy" and efficiently binds HTV tat protein, thereby preventing it from binding to TAR 
5 sequences encoded in the HIV RNA (Sullenger et al, 1990, Cell, 63, 601-608). This is but a 
specific example and those in the art will recognize that other embodiments can be readily 
generated using techniques generally known in the art, see for example Gold et al, 1995, 
Annu. Rev. Biochem., 64, 763; Brody and Gold, 2000, J. Biotechnol, 74, 5; Sun, 2000, Curr 
Opin. Mol Ther, 2, 100; Kusser, 2000, J. Biotechnol, 74, 27; Hermann and Patel, 2000, 
10 Science, 287, 820; and Jayasena, 1999, Clinical Chemistry, 45, 1628. Similarly, a decoy can 
be designed to bind to Ras and block the binding of Ras or a decoy can be designed to bind to 
Ras and prevent interaction with the Ras protein. 

By "aptamer" or "nucleic acid aptamer" as used herein is meant a nucleic acid molecule 
that binds specifically to a target molecule wherein the nucleic acid molecule has sequence 

15 that is distinct from sequence recognized by the target molecule in its natural setting. 
Alternately, an aptamer can be a nucleic acid molecule that binds to a target molecule where 
the target molecule does not naturally bind to a nucleic acid. The target molecule can be any 
molecule of interest. For example, the aptamer can be used to bind to a ligand binding domain 
of a protein, thereby preventing interaction of the naturally occurring ligand with the protein. 

20 Similarly, the nucleic acid molecules of the instant invention can bind to RAS, Her-2 or HIV 
encoded RNA or proteins receptors to block activity of the activity of target protein or nucleic 
acid. This is a non-limiting example and those in the art will recognize that other 
embodiments can be readily generated using techniques generally known in the art, see for 
example Gold et al, US 5,475,096 and 5,270,163; Gold et al, 1995, Annu. Rev. Biochem., 

25 64, 763; Brody and Gold, 2000, J. Biotechnol, 74, 5; Sim, 2000, Curr. Opin. Mol Titer., 2, 
100; Kusser, 2000, J. Biotechnol, 74, 27; Hermann and Patel, 2000, Science, 287, 820; and 
Jayasena, 1999, Clinical Chemistry, 45, 1628. 

The term "short interfering RNA" or "siRNA" as used herein refers to a double 
stranded nucleic acid molecule capable of RNA interference "RNAi", see for example Bass, 

30 2001, Nature, 411, 428-429; Elbashir et al., 2001, Nature, 41 1, 494-498; and Kreutzer et al, 
International PCT Publication No. WO 00/44895; Zernicka-Goetz et al, International PCT 
Publication No. WO 01/36646; Fire, International PCT Publication No. WO 99/32619; 
Plaetinck et al, International PCT Publication No. WO 00/01846; Mello and Fire, 
International PCT Publication No. WO 01/29058; Deschamps-Depaillette, International PCT 

35 Publication No. WO 99/07409; and Li et al, International PCT Publication No. WO 
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( 

00/44914. As used herein, siRNA molecules need not be limited to those molecules 
containing only RNA, but further encompasses chemically modified nucleotides and non- 
nucleotides. 

Nucleic acid molecules that modulate expression of Ras-specific RNAs represent a 
5 therapeutic approach to treat cancer, including, but not limited to colorectal cancer, bladder 
cancer, lung cancer, pancreatic cancer, breast cancer, or prostate cancer and any other cancer, 
disease or condition that responds to the modulation of Ras expression. 

Nucleic acid molecules that modulate expression of HTV-specific RNAs also represent 
a therapeutic approach to treat acquired immunodeficiency syndrome (AIDS) and/or any other 
1 0 disease, condition, or syndrome which respond to the modulation of HIV expression. 

Nucleic acid molecules that modulate expression of HER2-specific RNAs represent a 
therapeutic approach to treat cancer, including, but not limited to breast and ovarian cancer 
and any other cancer, disease or condition that responds to the modulation of HER2 
expression. 

15 In one embodiment of the inventions described herein, the enzymatic nucleic acid 

molecule is formed in a hammerhead or hairpin motif, but can also be formed in the motif of 
a hepatitis delta virus, group I intron, group II intron or RNase P RNA (in association with an 
RNA guide sequence), Neurospora VS RNA, DNAzymes, NCH cleaving motifs, or G- 
cleavers. Examples of such hammerhead motifs are described by Dreyfus, supra, Rossi et aL, 

20 1992, AIDS Research and Human Retroviruses 8, 183; of hairpin motifs by Hampel et aL, 
EP0360257, Hampel and Tritz, 1989 Biochemistry 28, 4929, Feldstein et aL, 1989, Gene 82, 
53, Haseloff and Gerlach, 1989, Gene, 82, 43, and Hampel et aL, 1990 Nucleic Acids Res. 18, 
299; Chowrira & McSwiggen, US. Patent No. 5,631,359; of the hepatitis delta virus motif is 
described by Perrotta and Been, 1992 Biochemistry 31, 16; of the RNase P motif by Guerrier- 

25 Takada et aL, 1983 Cell 35, 849; Forster and Altaian, 1990, Science 249, 783; Li and Altaian, 
1996, Nucleic Acids Res. 24, 835; Neurospora VS RNA ribozyme motif is described by 
Collins (Saville and Collins, 1990 Cell 61, 685-696; Saville and Collins, 1991 Proc. NatL 
Acad. Set USA 88, 8826-8830; Collins and Olive, 1993 Biochemistiy 32, 2795-2799; Guo 
and Collins, 1995, EMBO. J. 14, 363); Group H introns are described by Griffin et aL, 1995, 

30 Chem. BioL 2, 761; Michels and Pyle, 1995, Biochemistry 34, 2965; Pyle et aL, International 
PCT Publication No. WO 96/22689; of the Group I intron by Cech et aL, U.S. Patent 
4,987,071 and of DNAzymes by Usman et aL, International PCT Publication No. WO 
95/11304; Chartrand et aL, 1995, NAR 23, 4092; Breaker et aL, 1995, Chem. Bio. 2, 655; 
Santoro et aL, 1997, PNAS 94, 4262, and Beigelman et aL, International PCT publication No. 
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WO 99/55857. NCH cleaving motifs are described in Ludwig & Sproat, International PCT 
Publication No. WO 98/58058; and G-cleavers are described in Kore et aL, 1998, Nucleic 
Acids Research 26, 4116-4120 and Eckstein et aL, International PCT Publication No. WO 
99/16871. Additional motifs such as the Aptazyme (Breaker et al, WO 98/43993), 
5 Amberzyme (Class I motif; Figure 2; Beigelman et al, U.S. Serial No. 09/301,511) and 
Zinzyme (Figure 3) (Beigelman et al, U.S. Serial No. 09/301,511), all included by reference 
herein including drawings, can also be used in the present invention. These specific motifs or 
configurations are not limiting in the invention and those skilled in the art will recognize that 
all that is important in an enzymatic nucleic acid molecule of this invention is that it has a 
1 0 specific substrate binding site which is complementary to one or more of the target gene RNA 
regions, and that it have nucleotide sequences within or surrounding that substrate binding 
site which impart an RNA cleaving activity to the molecule (Cech et al., U.S. Patent No. 
4,987,071). 

In one embodiment of the present invention, a nucleic acid molecule of the instant 

15 invention can be between about 10 and 100 nucleotides in length. Exemplary enzymatic 
nucleic acid molecules of the invention are shown in the Tables herein. For example, 
enzymatic nucleic acid molecules of the invention are preferably between about 15 and 50 
nucleotides in length, more preferably between about 25 and 40 nucleotides in length, e.g., 
34, 36, or 38 nucleotides in length (for example see Jarvis et al, 1996, J. Biol. Chem., 271, 

10 29107-291 12). Exemplary DNAzymes of the invention are preferably between about 15 and 
40 nucleotides in length, more preferably between about 25 and 35 nucleotides in length, e.g., 
29, 30, 31, or 32 nucleotides in length (see for example Santoro et aL, 1998, Biochemistry, 
37, 13330-13342; Chartrand et aL, 1995, Nucleic Acids Research, 23, 4092-4096). 
Exemplary antisense molecules of the invention are preferably between about 15 and 75 

25 nucleotides in length, more preferably between about 20 and 35 nucleotides in length, e.g., 
25, 26, 27, or 28 nucleotides in length (see for example Woolf et al., 1992, PNAS., 89, 7305- 
7309; Milner et al., 1997, Nature Biotechnology, 15, 537-541). Exemplary triplex forming 
oligonucleotide molecules of the invention are preferably between about 10 and 40 
nucleotides in length, more preferably between about 12 and 25 nucleotides in length, e.g., 

30 18, 19, 20, or 21 nucleotides in length (see for example Maher et aL, 1990, Biochemistry, 29, 
8820-8826; Strobel and Dervan, 1990, Science, 249, 73-75). Those skilled in the art will 
recognize that all that is required is for a nucleic acid molecule to be of length and 
conformation sufficient and suitable for the nucleic acid molecule to interact with its target 
and/or catalyze a reaction contemplated herein. The length of nucleic acid molecules of the 

35 instant invention are not limiting within the general limits stated. 
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Preferably, a nucleic acid molecule that modulates, for example, down-regulates Ras, 
HIV, and/or HER2 expression and/or activity, comprises between 12 and 100 bases 
complementary to a RNA molecule of Ras, HIV, and/or HER2 respectively. Even more 
preferably, a nucleic acid molecule that modulates Ras, HIV, and/or HER2 expression 
5 comprises between 14 and 24 bases complementary to a RNA molecule of Ras, HIV, and/or 
HER2 respectively. 

The invention provides a method for producing a class of nucleic acid-based gene 
modulating agents that exhibit a high degree of specificity for RNA of a desired target. For 
example, an enzymatic nucleic acid molecule is preferably targeted to a highly conserved 

1 0 sequence region of target RNAs encoding Ras (and specifically a Ras gene) such that specific 
treatment of a disease or condition can be provided with either one or several nucleic acid 
molecules of the invention. Such nucleic acid molecules can be delivered exogenously to 
specific tissue or cellular targets as required. Alternatively, the nucleic acid molecules (e.g., 
enzymatic nucleic acid molecules, siRNA, antisense, and/or DNAzymes) can be expressed 

1 5 from DNA and/or RNA vectors that are delivered to specific cells. 

As used herein "cell" is used in its usual biological sense, and does not refer to an entire 
multicellular organism. A cell can, for example, be in vitro, e.g., in cell culture, or present in 
a multicellular organism, including, e.g., birds, plants and mammals such as humans, cows, 
sheep, apes, monkeys, swine, dogs, and cats. The cell can be prokaryotic (e.g., bacterial cell) 
20 or eukaryotic (e.g., mammalian or plant cell). 

By "Ras proteins" is meant, a peptide or protein comprising Ras tyrosine kinase-type 
cell surface receptor or a peptide or protein encoded by a Ras gene, such as K-Ras, H-Ras, or 
N-Ras. 

By "HIV proteins" is meant, a peptide or protein comprising a component of HIV or a 
25 peptide or protein encoded by a HIV gene. 

By "HER2 proteins" is meant, a peptide or protein comprising HER2/ERB2/NEU 
tyrosine kinase-type cell surface receptor or a peptide or protein encoded by a 
HER2/ERB2/NEU gene. 

By "highly conserved sequence region" is meant, a nucleotide sequence of one or more 
30 regions in a target gene that does not vary significantly from one generation to the other or 
from one biological system to the other. 
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Nucleic acid-based modulators, including inhibitors, of Ras expression are useful for 
the prevention and/or treatment of cancer, including but not limited to breast cancer and 
ovarian cancer and any other disease or condition that respond to the modulation of Ras 
expression. 

5 Nucleic acid-based inhibitors of HTV expression are useful for the prevention and/or 

treatment of acquired immunodeficiency disease (AIDS) and related diseases and conditions, 
including but not limited to Kaposi's sarcoma, lymphoma, cervical cancer, squamous cell 
carcinoma, cardiac myopathy, rheumatic diseases, and opportunistic infection, for example 
Pneumocystis carinii, Cytomegalovirus, Herpes simplex, Mycobacteria, Cryptococcus, 
10 Toxoplasma, Progressive multifocal leucoencepalopathy (Papovavirus), Mycobacteria, 
Aspergillus, Cryptococcus, Candida, Cryptosporidium, Isospora belli, Microsporidia and any 
other disease or condition which respond to the modulation of HIV expression. 

Nucleic acid-based inhibitors of HER2 expression are useful for the prevention and/or 
treatment of cancer, including but not limited to breast cancer and ovarian cancer and any 
1 5 other disease or condition that respond to the modulation of HER2 expression. 

By "related" is meant that the reduction of RAS, HIV, or HER2 expression (specifically 
RAS, HTV, or HER2 genes respectively) RNA levels and thus reduction in the level of the 
respective protein relieves, to some extent, the symptoms of the disease or condition. 

The nucleic acid-based molecules of the invention can be added directly, or can be 
20 complexed with cationic lipids, packaged within liposomes, or otherwise delivered to target 
cells or tissues. The nucleic acid or nucleic acid complexes can be locally administered to 
relevant tissues ex vivo, or in vivo through injection or infusion pump, with or without their 
incorporation in biopolymers. In certain embodiments, the enzymatic nucleic acid molecules 
comprise sequences that are complementary to the substrate sequences in the Tables herein. 
25 Examples of such enzymatic nucleic acid molecules also are shown in the Tables herein. 
Examples of such enzymatic nucleic acid molecules consist essentially of sequences defined 
in these tables. 

In another embodiment, the invention features siRNA, antisense nucleic acid molecules 
and 2-5A chimeras comprising sequences complementary to the substrate sequences shown in 
30 the Tables herein. Such nucleic acid molecules can comprise sequences as shown for the 
binding arms of the enzymatic nucleic acid molecules in the Tables. Similarly, triplex 
molecules can be targeted to corresponding DNA target regions; such molecules can comprise 
the DNA equivalent of a target sequence or a sequence complementary to the specified target 
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(substrate) sequence. Typically, antisense molecules are complementary to a target sequence 
along a single contiguous sequence of the antisense molecule. However, in certain 
embodiments, an antisense molecule can bind to a substrate such that the substrate molecule 
forms a loop, and/or an antisense molecule can bind such that the antisense molecule forms a 
5 loop. Thus, the antisense molecule can be complementary to two or more non-contiguous 
substrate sequences. In addition, two or more non-contiguous sequence portions of an 
antisense molecule can be complementary to a target sequence. 

By "consists essentially of is meant that the active nucleic acid molecule of the 
invention, for example, an enzymatic nucleic acid molecule, contains an enzymatic center or 

10 core equivalent to those in the examples, and binding arms able to bind RNA such that 
cleavage at the target site occurs. Other sequences can be present that do not interfere with 
such cleavage. Thus, a core region of an enzymatic nucleic acid molecule can, for example, 
include one or more loop, stem-loop structure, or linker that does not prevent enzymatic 
activity. Thus, various regions in the sequences in the Tables can be such a loop, stem-loop, 

1 5 nucleotide linker, and/or non-nucleotide linker and can be represented generally as sequence 
"X". The nucleic acid molecules of the instant invention, such as Hammerhead, Inozyme, G- 
cleaver, amberzyme, zinzyme, DNAzyme, antisense, 2-5A antisense, triplex forming nucleic 
acid, and decoy nucleic acids, can contain other sequences or non-nucleotide linkers that do 
not interfere with the function of the nucleic acid molecule. 

20 Sequence X can be a linker of > 2 nucleotides in length, preferably 3, 4, 5, 6, 7, 8, 9, 10, 

15, 20, 26, 30, where the nucleotides can preferably be internally base-paired to form a stem 
of preferably > 2 base pairs. Alternatively or in addition, sequence X can be a non-nucleotide 
linker. In yet another embodiment, the nucleotide linker X can be a nucleic acid aptamer, such 
as an ATP aptamer, Ras Rev aptamer (RRE), Ras Tat aptamer (TAR) and others (for a review 

25 see Gold et al, 1995, Annu, Rev. Biochem., 64, 763; and Szostak & Ellington, 1993, in The 
RNA World, ed. Gesteland and Atkins, pp. 511, CSH Laboratory Press). A "nucleic acid 
aptamer" as used herein is meant to indicate a nucleic acid sequence capable of interacting 
with a ligand. The ligand can be any natural or a synthetic molecule, including but not limited 
to a resin, metabolites, nucleosides, nucleotides, drugs, toxins, transition state analogs, 

30 peptides, lipids, proteins, amino acids, nucleic acid molecules, hormones, carbohydrates, 
receptors, cells, viruses, bacteria and others. 

In yet another embodiment, a non-nucleotide linker X is as defined herein. Non- 
nucleotides as can include abasic nucleotide, polyether, polyamine, polyamide, peptide, 
carbohydrate, lipid, or polyhydrocarbon compounds. Specific examples include those 
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described by Seela and Kaiser, Nucleic Acids Res. 1990, 75:6353 and Nucleic Acids Res. 
1987, 75:3113; Cload and Schepartz, 1 Am. Chem. Soc. 1991, 775:6324; Richardson and 
Schepartz, J. Am. Chem. Soc. 1991, 773:5109; Ma et al, Nucleic Acids Res. 1993, 27:2585 
and Biochemistiy 1993, 52:1751; Durand et al, Nucleic Acids Res. 1990, 75:6353; McCurdy 
5 et al, Nucleosides & Nucleotides 1991, 70:287; Jschke et al, Tetrahedron Lett, 1993, 
34:301; Ono et al., Biochemistry 1991, 30:9914; Arnold et al, International Publication No. 
WO 89/02439; Usman et al, International Publication No. WO 95/06731; Dudycz et al, 
International Publication No. WO 95/11910 and Ferentz and Verdine, J, Am. Chem. Soc, 
1991, 773:4000, all hereby incorporated by reference herein. A "non-nucleotide" further 

1 0 means any group or compound that can be incorporated into a nucleic acid chain in the place 
of one or more nucleotide units, including either sugar and/or phosphate substitutions, and 
allows the remaining bases to exhibit their enzymatic activity. The group or compound can 
be abasic in that it does not contain a commonly recognized nucleotide base, such as 
adenosine, guanine, cytosine, uracil or thymine. Thus, in a preferred embodiment, the 

1 5 invention features an enzymatic nucleic acid molecule having one or more non-nucleotide 
moieties, and having enzymatic activity to cleave an RNA or DNA molecule. 

In another aspect of the invention, enzymatic nucleic acid molecules, siRNA molecules 
or antisense molecules that interact with target RNA molecules and modulate gene expression 
activity are expressed from transcription units inserted into DNA or RNA vectors. The 

20 recombinant vectors are preferably DNA plasmids or viral vectors. Enzymatic nucleic acid 
molecule or antisense expressing viral vectors can be constructed based on, but not limited to, 
adeno-associated virus, retrovirus, adenovirus, or alphavirus as well as others known in the 
art. Preferably, recombinant vectors capable of expressing enzymatic nucleic acid molecules 
or antisense are delivered as described below, and persist in target cells. Alternatively, viral 

25 vectors can be used that provide for transient expression of enzymatic nucleic acid molecules 
or antisense. Such vectors can be repeatedly administered as necessary. Once expressed, the 
enzymatic nucleic acid molecules or antisense bind to target RNA and modulate its function 
or expression. Delivery of enzymatic nucleic acid molecule or antisense expressing vectors 
can be systemic, such as by intravenous or intramuscular administration, by administration to 

30 target cells ex -planted from the patient followed by reintroduction into the patient, or by any 
other means that allows for introduction into a desired target cell. Antisense DNA and 
DNAzymes can be expressed via the use of a single stranded DNA intracellular expression 
vector. 

By "vectors" is meant any nucleic acid- and/or viral-based technique used to deliver a 
35 desired nucleic acid. 
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By "subject" or "patient" is meant an organism that is a donor or recipient of explanted 
cells or the cells of the organism. "Subject" or "patient" also refers to an organism to which 
the nucleic acid molecules of the invention can be administered. Preferably, a subject or 
patient is a mammal or mammalian cells. More preferably, a subject or patient is a human or 
5 human cells. 

By "enhanced enzymatic activity" is meant to include activity measured in cells and/or 
in vivo where the activity is a reflection of both the catalytic activity and the stability of the 
nucleic acid molecules of the invention. In this invention, the product of these properties can 
be increased in vivo compared to an all RNA enzymatic nucleic acid or all DNA enzyme, for 
1 0 example, with a nucleic acid molecule comprising chemical modifications. In some cases, the 
activity or stability of the nucleic acid molecule can be decreased (i.e., less than ten-fold), but 
the overall activity of the nucleic acid molecule is enhanced, in vivo. 

Nucleic acid molecules of the instant invention, individually, or in combination or in 
conjunction with other drugs, can be used to treat diseases or conditions discussed above. For 
1 5 example, to treat a disease or condition associated with the levels of Ras, HIV, or HER2, a 
subject can be treated, or other appropriate cells can be treated, as is evident to those skilled 
in the art, individually or in combination with one or more drugs under conditions suitable for 
the treatment. 

In a further embodiment, the described molecules, such as antisense, siRNA, or 
20 enzymatic nucleic acid molecules, can be used in combination with other known treatments to 
treat conditions or diseases discussed above. For example, the described molecules can be 
used in combination with one or more known therapeutic agents to treat cancer, for example 
colorectal cancer, bladder cancer, lung cancer, pancreatic cancer, breast cancer, or prostate 
cancer, and any other disease or condition that respond to the modulation of Ras expression. 

25 In another embodiment, the invention features nucleic acid-based inhibitors (e.g., 

enzymatic nucleic acid molecules, (including DNAzymes), siRNA and methods for their use 
to down regulate or inhibit the expression of genes (e.g., Ras genes) capable of progression 
and/or maintenance of cancer and/or other disease states that respond to the modulation of 
Ras expression. 

30 In a further embodiment, the described molecules, such as antisense, siRNA, or 

enzymatic nucleic acids, can be used in combination with other known treatments to treat 
conditions or diseases discussed above. For example, the described molecules can be used in 
combination with one or more known therapeutic agents to treat acquired immunodeficiency 
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disease (AIDS) and related diseases and conditions, including but not limited to Kaposi's 
sarcoma, lymphoma, cervical cancer, squamous cell carcinoma, cardiac myopathy, rheumatic 
diseases, and opportunistic infection, for example Pneumocystis carinii, Cytomegalovirus, 
Herpes simplex, Mycobacteria, Cryptococcus, Toxoplasma, Progressive multifocal 
5 leucoencepalopathy (Papovavirus), Mycobacteria, Aspergillus, Cryptococcus, Candida, 
Cryptosporidium, Isospora belli, Microsporidia and any other disease or condition which 
respond to the modulation of HIV expression. 

Nucleic acid molecules of the instant invention, individually, or in combination or in 
conjunction with other drugs, can be used to treat diseases or conditions discussed above. For 
10 example, to treat a disease or condition associated with the levels of HER2, a patient can be 
treated, or other appropriate cells can be treated, as is evident to those skilled in the art, 
individually or in combination with one or more drugs under conditions suitable for the 
treatment. 

In a further embodiment, the described molecules, such as antisense, siRNA or 
1 5 enzymatic nucleic acid molecules, can be used in combination with other known treatments to 
treat conditions or diseases discussed above. For example, the described molecules can be 
used in combination with one or more known therapeutic agents to treat cancer, for example 
ovarian cancer and/or breast cancer, and any other disease or condition that respond to the 
modulation ofHER2 expression. 

20 In another embodiment, the invention features nucleic acid-based inhibitors (e.g., 

enzymatic nucleic acid molecules, (including ribozymes, antisense nucleic acids, 2-5A 
antisense chimeras, triplex DNA, antisense nucleic acids , containing RNA cleaving chemical 
groups), siRNA and methods for their use to down regulate or inhibit the expression of genes 
(e.g., HER2 genes) capable of progression and/or maintenance of cancer and/or other disease 

25 states that respond to the modulation of HER2 expression. 

By "comprising" is meant including, but not limited to, whatever follows the word 
"comprising". Thus, use of the term "comprising" indicates that the listed elements are 
required or mandatory, but that other elements are optional and may or may not be present. 
By "consisting of is meant including, and limited to, whatever follows the phrase "consisting 
30 of. 

Other features and advantages of the invention will be apparent from the following 
description of the preferred embodiments thereof, and from the claims. 
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Mechanism of action of Nucleic Acid Molecules of the Invention as is Know in the Art 

Antisense : Antisense molecules can be modified or unmodified RNA, DNA, or mixed 
polymer oligonucleotides and primarily function by specifically binding to matching 
sequences resulting in inhibition of peptide synthesis (Wu-Pong, Nov 1994, BioPharm, 20- 
5 33). The antisense oligonucleotide binds to target RNA by Watson Crick base-pairing and 
blocks gene expression by preventing ribosomal translation of the bound sequences either by 
steric blocking or by activating RNase H enzyme. Antisense molecules can also alter protein 
synthesis by interfering with RNA processing or transport from the nucleus into the 
cytoplasm (Mukhopadhyay & Roth, 1996, Crit. Rev. in Oncogenesis 7, 151-190). 

10 In addition, binding of single stranded DNA to RNA can result in nuclease degradation 

of the heteroduplex (Wu-Pong, supra; Crooke, supra). Backbone modified DNA chemistry 
which have been thus far been shown to act as substrates for RNase H are phosphorothioates, 
phosphorodithioates, and borontrifluoridates. In addition, 2'-arabino and 2'-fluoro arabino- 
containing oligos can also activate RNase H activity. 

15 A number of antisense molecules have been described that utilize novel configurations 

of chemically modified nucleotides, secondary structure, and/or RNase H substrate domains 
(Woolf et al, International PCT Publication No. WO 98/13526; Thompson et al t 
International PCT Publication No. WO 99/54459; Hartmann et al., USSN 60/101,174, filed 
on September 21, 1998). All of these references are incorporated by reference herein in their 

20 entirety. 

In addition, antisense deoxyoligoribonucleotides can be used to target RNA by means 
of DNA-RNA interactions, thereby activating RNase H, which digests the target RNA in the 
duplex. Antisense DNA can be expressed via the use of a single stranded DNA intracellular 
expression vector or equivalents and variations thereof. 

25 RNA interference : RNA interference refers to the process of sequence specific post 

transcriptional gene silencing in animals mediated by short interfering RNAs (siRNA) (Fire et 
al, 1998, Nature, 391, 806). The corresponding process in plants is commonly referred to as 
post transcriptional gene silencing or RNA silencing and is also referred to as quelling in 
fungi. The process of post transcriptional gene silencing is thought to be an evolutionarily 

30 conserved cellular defense mechanism used to prevent the expression of foreign genes which 
is commonly shared by diverse flora and phyla (Fire et al, 1999, Trends Genet, 15, 358). 
Such protection from foreign gene expression may have evolved in response to the production 
of double stranded RNAs (dsRNA) derived from viral infection or the random integration of 
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transposon elements into a host genome via a cellular response that specifically destroys 
homologous single stranded RNA or viral genomic RNA. The presence of dsRNA in cells 
triggers the RNAi response though a mechanism that has yet to be fully characterized. This 
mechanism appears to be different from the interferon response that results from dsRNA 
5 mediated activation of protein kinase PKR and 2',5'-oligoadenylate synthetase resulting in 
non-specific cleavage of mRNA by ribonuclease L. 

The presence of long dsRNAs in cells stimulates the activity of a ribonuclease DI 
enzyme referred to as dicer. Dicer is involved in the processing of the dsRNA into short 
pieces of dsRNA known as short interfering RNAs (siRNA) (Berstein et al, 2001, Nature, 

10 409, 363). Short interfering RNAs derived from dicer activity are typically about 21-23 
nucleotides in length and comprise about 19 base pair duplexes. Dicer has also been 
implicated in the excision of 21 and 22 nucleotide small temporal RNAs (stRNA) from 
precursor RNA of conserved structure that are implicated in translational control (Hutvagner 
et al, 2001, Science, 293, 834). The RNAi response also features an endonuclease complex 

1 5 containing a siRNA, commonly referred to as an RNA-induced silencing complex (RISC), 
which mediates cleavage of single stranded RNA having sequence homologous to the siRNA. 
Cleavage of the target RNA takes place in the middle of the region complementary to the 
guide sequence of the siRNA duplex (Elbashir et al, 2001, Genes Dev., 15, 188). 

Short interfering RNA mediated RNAi has been studied in a variety of systems. Fire et 

20 al, 1998, Nature, 391, 806, were the first to observe RNAi in C. Elegans. Wianny and 
Goetz, 1999, Nature Cell Biol, 2, 70, describes RNAi mediated by dsRNA in mouse 
embryos. Hammond et al, 2000, Nature, 404, 293, describe RNAi in Drosophila cells 
transfected with dsRNA. Elbashir et al, 2001 , Nature, 411, 494, describe RNAi induced by 
introduction of duplexes of synthetic 21 -nucleotide RNAs in cultured mammalian cells 

25 including human embryonic kidney and HeLa cells. Recent work in Drosophila embryonic 
lysates has revealed certain requirements for siRNA length, structure, chemical composition, 
and sequence that are essential to mediate efficient RNAi activity. These studies have shown 
that 21 nucleotide siRNA duplexes are most active when containing two nucleotide 3'- 
overhangs. Furthermore, substitution of one or both siRNA strands with 2'-deoxy or 2'-0- 

30 methyl nucleotides abolishes RNAi activity, whereas substitution of 3 '-terminal siRNA 
nucleotides with deoxy nucleotides was shown to be tolerated. Mismatch sequences in the 
center of the siRNA duplex were also shown to abolish RNAi activity. In addition, these 
studies also indicate that the position of the cleavage site in the target RNA is defined by the 
5'-end of the siRNA guide sequence rather than the 3'-end (Elbashir et al, 2001, EMBO J., 

35 20, 6877). Other studies have indicated that a 5'-phosphate on the target-complementary 
strand of a siRNA duplex is required for siRNA activity and that ATP is utilized to maintain 
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the 5'-phosphate moiety on the siRNA (Nykanen et al, 2001, Cell, 107, 309), however 
siRNA molecules lacking a 5 '-phosphate are active when introduced exogenously, suggesting 
that 5 '-phosphorylation of siRNA constructs may occur in vivo. 

Enzymatic Nucleic Acid : Several varieties of naturally-occurring enzymatic RNAs are 
5 presently known. In addition, several in vitro selection (evolution) strategies (Orgel, 1979, 
Proc. R. Soc. London, B 205, 435) have been used to evolve new nucleic acid catalysts 
capable of catalyzing cleavage and ligation of phosphodiester linkages (Joyce, 1989, Gene, 
82, 83-87; Beaudry et al, 1992, Science 257, 635-641; Joyce, 1992, Scientific American 267, 
90-97; Breaker et al, 1994, TIBTECH 12, 268; Bartel et al, 1993, Science 261:1411-1418; 

1 0 Szostak, 1993, TIBS 17, 89-93; Kumar et al, 1995, FASEB J., 9, 1 183; Breaker, 1996, Curr. 
Op. Biotech, 7, 442; Santoro et al, 1997, Proc. Natl Acad. Set, 94, 4262; Tang et al, 1991, 
RNA 3, 914; Nakamaye & Eckstein, 1994, supra] Long & Uhlenbeck, 1994, supra; Ishizaka 
et al, 1995, supra; Vaish et al, 1991, Biochemistry 36, 6495; all of these are incorporated by 
reference herein). Each can catalyze a series of reactions including the hydrolysis of 

15 phosphodiester bonds in trans (and thus can cleave other RNA molecules) under 
physiological conditions. 

Nucleic acid molecules of this invention can modulate, e.g., down-regulate, Ras protein 
expression and can be used to treat disease or diagnose disease associated with the levels of 
Ras, HIV and/or HER2. Enzymatic nucleic acid sequences targeting Ras, HIV and/or HER2 
20 RNA and sequences that can be targeted with nucleic acid molecules of the invention to 
down-regulate Ras expression are shown in the Tables herein. 

The enzymatic nature of an enzymatic nucleic acid molecule allows the concentration 
of enzymatic nucleic acid molecule necessary to affect a therapeutic treatment to be lower 
than a nucleic acid molecule lacking enzymatic activity. This reflects the ability of the 

25 enzymatic nucleic acid molecule to act enzymatically. Thus, a single enzymatic nucleic acid 
molecule is able to cleave many molecules of target RNA. In addition, the enzymatic nucleic 
acid molecule is a highly specific inhibitor, with the specificity of inhibition depending not 
only on the base-pairing mechanism of binding to the target RNA, but also on the mechanism 
of target RNA cleavage. Single mismatches, or base-substitutions, near the site of cleavage 

30 can be chosen to completely eliminate catalytic activity of a enzymatic nucleic acid molecule. 

Nucleic acid molecules having an endonuclease enzymatic activity are able to 
repeatedly cleave other separate RNA molecules in a nucleotide base sequence-specific 
manner. With proper design and construction, such enzymatic nucleic acid molecules can be 
targeted to virtually any RNA transcript, and achieve efficient cleavage in vitro (Zaug et al, 
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324, Nature 429 1986; Uhlenbeck, 1987 Nature 328, 596; Kim et al, 84 Proc. Natl Acad. 
Set USA 8788, 1987; Dreyfus, 1988, Einstein Quart J. Bio. Med., 6, 92; Haseloff and 
Gerlach, 334 Nature 585, 1988; Cech, 260 JAMA 3030, 1988; and Jefferies et al, 17 Nucleic 
Acids Research 1371, 1989; Santoro etal, 1997 supra). 

5 Because of their sequence specificity, taws-cleaving enzymatic nucleic acid molecules 

can be used as therapeutic agents for human disease (Usman & McSwiggen, 1995 Ann. Rep. 
Med. Chem. 30, 285-294; Christoffersen and Marr, 1995 J. Med. Chem. 38, 2023-2037). 
Enzymatic nucleic acid molecules can be designed to cleave specific RNA targets within the 
background of cellular RNA. Such a cleavage event renders the RNA non-functional and 
1 0 abrogates protein expression from that RNA. In this manner, synthesis of a protein associated 
with a disease state can be selectively inhibited (Warashina et al, 1999, Chemistry and 
Biology, 6, 237-250). 

Enzymatic nucleic acid molecules of the invention that are allosterically regulated 
("allozymes") can be used to modulate, including down-regulate, Ras, HIV and/or HER2 

1 5 expression. These allosteric enzymatic nucleic acids or allozymes (see for example George et 
al, US Patent Nos. 5,834,186 and 5,741,679, Shih et al, US Patent No. 5,589,332, Nathan et 
aL, US Patent No 5,871,914, Nathan and Ellington, International PCT publication No. WO 
00/24931, Breaker et aL, International PCT Publication Nos. WO 00/26226 and 98/27104, 
and Sullenger et al, International PCT publication No. WO 99/29842) are designed to 

20 respond to a signaling agent, for example, mutant Ras, HIV and/or HER2 protein, wild-type 
Ras, HIV and/or HER2 protein, mutant Ras, HIV and/or HER2 RNA, wild-type Ras, HIV 
and/or HER2 RNA, other proteins and/or RNAs involved in Ras, HIV and/or HER2 activity, 
compounds, metals, polymers, molecules and/or drugs that are targeted to Ras, HIV and/or 
HER2 expressing cells etc., which, in turn, modulate the activity of the enzymatic nucleic 

25 acid molecule. In response to interaction with a predetermined signaling agent, the activity of 
the allosteric enzymatic nucleic acid molecule is activated or inhibited such that the 
expression of a particular target is selectively regulated, including down-regulated. The target 
can comprise wild-type Ras, HIV and/or HER2, mutant Ras, HIV and/or HER2, a component 
of Ras, HIV and/or HER2, and/or a predetermined cellular component that modulates Ras, 

30 HIV and/or HER2 activity. For example, allosteric enzymatic nucleic acid molecules that are 
activated by interaction with a RNA encoding Ras, HIV and/or HER2 protein can be used as 
therapeutic agents in vivo. The presence of RNA encoding the Ras, HIV and/or HER2 
protein activates the allosteric enzymatic nucleic acid molecule that subsequently cleaves the 
RNA encoding Ras, HIV and/or HER2 protein, resulting in the inhibition of Ras, HIV and/or 
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HER2 protein expression. In this manner, cells that express the Ras, HIV and/or HER2 
protein are selectively targeted. 

In another non-limiting example, an allozyme can be activated by a Ras, HIV and/or 
HER2 protein, peptide, or mutant polypeptide that causes the allozyme to inhibit the 
5 expression of Ras, HTV and/or HER2 gene, by, for example, cleaving RNA encoded by Ras, 
HTV and/or HER2 gene. In this non-limiting example, the allozyme acts as a decoy to inhibit 
the function of Ras, HTV and/or HER2 and also inhibit the expression of Ras, HTV and/or 
HER2 once activated by the Ras, HIV and/or HER2 protein. 

Target sites 

1 0 Targets for useful enzymatic nucleic acid molecules and antisense nucleic acids can be 

determined as disclosed in Draper et al, WO 93/23569; Sullivan et al, WO 93/23057; 
Thompson et al, WO 94/02595; Draper et al, WO 95/04818; McSwiggen et al, US Patent 
No. 5,525,468, and hereby incorporated by reference herein in totality. Other examples 
include the following PCT applications, which concern inactivation of expression of disease- 

1 5 related genes: WO 95/23225, WO 95/13380, WO 94/02595, incorporated by reference herein. 
Rather than repeat the guidance provided in those documents here, below are provided 
specific non-limiting examples of such methods. Enzymatic nucleic acid molecules to such 
targets are designed as described in the above applications and synthesized to be tested in 
vitro and in vivo, as also described. The sequences of human K-Ras, H-Ras, HIV-1 and HER2 

20 RNAs were screened for optimal enzymatic nucleic acid target sites using a computer-folding 
algorithm. Nucleic acid molecule binding/cleavage sites were identified. These sites are 
shown in the Tables (all sequences are 5' to 3' in the tables). The nucleotide base position is 
noted in the Tables as that site to be cleaved by the designated type of enzymatic nucleic acid 
molecule. Human sequences can be screened and enzymatic nucleic acid molecule and/or 

25 antisense thereafter designed, as discussed in Stinchcomb et al f WO 95/23225. In addition, 
mouse targeted nucleic acid molecules can be used to test efficacy of action of the enzymatic 
nucleic acid molecule, siRNA and/or antisense prior to testing in humans. 

In addition, enzymatic nucleic acid, siRNA, and antisense nucleic acid molecule 
binding/cleavage sites were identified. The nucleic acid molecules are individually analyzed 
30 by computer folding (Jaeger et al, 1989 Proc. Natl Acad. Set USA, 86, 7706) to assess 
whether the sequences fold into the appropriate secondary structure. Those nucleic acid 
molecules with unfavorable intramolecular interactions, such as between, for example the 
binding arms and the catalytic core of an enzymatic nucleic acid, are eliminated from 
consideration. Varying binding arm lengths can be chosen to optimize activity. 



WO 02/097114 



PCT/US02/16840 



Antisense, hammerhead, DNAzyme, NCH, amberzyme, zinzyme or G-Cleaver 
enzymatic nucleic acid molecule, siRNA, and antisense nucleic acid binding/cleavage sites 
were identified and were designed to anneal to various sites in the RNA target. The 
enzymatic nucleic acid binding arms or siRNA and antisense nucleic acid sequences are 
5 complementary to the target site sequences described above. The nucleic acid molecules are 
chemically synthesized. The method of synthesis used follows the procedure for normal 
DNA/RNA synthesis as described below and in Usman et al, 1987 J. Am. Chem. Soc, 109, 
7845; Scaringe et al, 1990 Nucleic Acids Res., 18, 5433; and Wincott et al, 1995 Nucleic 
Acids Res. 23, 2677-2684; Caruthers et al., 1992, Methods in Enzymology 21 1,3-19. 

10 Synthesis of Nucleic acid Molecules 

Synthesis of nucleic acids greater than 100 nucleotides in length can be difficult using 
automated methods, and the therapeutic cost of such molecules can be prohibitive. In this 
invention, small nucleic acid motifs ("small" refers to nucleic acid motifs less than about 100 
nucleotides in length, preferably less than about 80 nucleotides in length, and more preferably 
1 5 less than about 50 nucleotides in length; e.g., DNAzymes) are preferably used for exogenous 
delivery. The simple structure of these molecules increases the ability of the nucleic acid to 
invade targeted regions of RNA structure. Exemplary molecules of the instant invention are 
chemically synthesized as described herein, and others can similarly be synthesized. 

Oligonucleotides (e.g., DNAzymes, antisense) are synthesized using protocols known 
20 in the art as described in Caruthers et al., 1992, Methods in Enzymology 211, 3-19, Thompson 
et al, International PCT Publication No. WO 99/54459, Wincott et al, 1995, Nucleic Acids 
Res. 23, 2677-2684, Wincott et al, 1991, Methods Mol. Bio., 74, 59, Brennan et al, 1998, 
Biotechnol Bioeng., 61, 33-45, and Brennan, US patent No. 6,001,311. All of these 
references are incorporated herein by reference. The synthesis of oligonucleotides makes use 
25 of common nucleic acid protecting and coupling groups, such as dimethoxytrityl at the S'-end, 
and phosphoramidites at the 3 ! -end. In a non-limiting example, small scale syntheses are 
conducted on a 394 Applied Biosystems, Inc. synthesizer using a 0.2 jimol scale protocol 
with a 2.5 min coupling step for 2'-0~methylated nucleotides and a 45 sec coupling step for 
2'-deoxy nucleotides. Table I outlines the amounts and the contact times of the reagents 
30 used in the synthesis cycle. Alternatively, syntheses at the 0.2 ^imol scale can be performed 
on a 96-well plate synthesizer, such as the instrument produced by Protogene (Palo Alto, CA) 
with minimal modification to the cycle. A 33-fold excess (60 |iL of 0.1 1 M = 6.6 ^mol) of 
2'-0-methyl phosphoramidite and a 105-fold excess of S-ethyl tetrazole (60 (iL of 0.25 M = 
15 nmol) can be used in each coupling cycle of 2'-0-methyl residues relative to polymer- 
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bound 5'-hydroxyl. A 22-fold excess (40 (iL of 0.11 M - 4 .4 \imol) of deoxy 
phosphoramidite and a 70-fold excess of S-ethyl tetrazole (40 \iL of 0.25 M = 10 nmol) can 
be used in each coupling cycle of deoxy residues relative to polymer-bound S'-hydroxyl. 
Average coupling yields on the 394 Applied Biosystems, Inc. synthesizer, determined by 
5 colorimetric quantitation of the trityl fractions, are typically 97.5-99%. Other oligonucleotide 
synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer include; detritylation 
solution is 3% TCA in methylene chloride (ABI); capping is performed with 16% Af-methyl 
imidazole in THF (ABI) and 10% acetic anhydride/10% 2,6-lutidine in THF (ABI); and 
oxidation solution is 16.9 mM I 2 , 49 mM pyridine, 9% water in THF (PERSEPTIVE™). 
1 0 Burdick & Jackson Synthesis Grade acetonitrile is used directly from the reagent bottle. S- 
Ethyltetrazole solution (0.25 M in acetonitrile) is made up from the solid obtained from 
American International Chemical, Inc. Alternately, for the introduction of phosphorothioate 
linkages, Beaucage reagent (3H-l,2-Benzodithiol-3-one 1,1-dioxide, 0.05 M in acetonitrile) is 
used. 

15 Deprotection of the DNAzymes is performed as follows: the polymer-bound trityl-on 

oligoribonucleotide is transferred to a 4 mL glass screw top vial and suspended in a solution 
of 40% aq. methylamine (1 mL) at 65 °C for 10 min. After cooling to -20 °C, the supernatant 
is removed from the polymer support. The support is washed three times with 1.0 mL of 
EtOH:MeCN:H20/3:l:l, vortex ed and the supernatant is then added to the first supernatant. 

20 The combined supernatants, containing the oligoribonucleotide, are dried to a white powder. 

The method of synthesis used for RNA and chemically modified RNA or DNA, 
including certain enzymatic nucleic acid molecules and siRNA molecules, follows the 
procedure as described in Usman et al, 1987, J. Am. Chem. Soc, 109, 7845; Scaringe et al, 
1990, Nucleic Acids Res., 18, 5433; and Wincott et al, 1995, Nucleic Acids Res. 23, 2677- 

25 2684 Wincott et al., 1997, Methods Mol Bio., 74, 59, and makes use of common nucleic acid 
protecting and coupling groups, such as dimethoxytrityl at the 5 -end, and phosphoramidites 
at the 3'-end. In a non-limiting example, small scale syntheses are conducted on a 394 
Applied Biosystems, Inc. synthesizer using a 0.2 (amol scale protocol with a 7.5 min coupling 
step for alkylsilyl protected nucleotides and a 2.5 min coupling step for 2'-0-methylated 

30 nucleotides. Table I outlines the amounts and the contact times of the reagents used in the 
synthesis cycle. Alternatively, syntheses at the 0.2 (irnol scale can be done on a 96-well plate 
synthesizer, such as the instrument produced by Protogene (Palo Alto, CA) with minimal 
modification to the cycle. A 33-fold excess (60 nL of 0.11 M = 6.6 jamol) of 2'-0-methyl 
phosphoramidite and a 75-fold excess of S-ethyl tetrazole (60 ^L of 0.25 M = 15 fimol) can 

35 be used in each coupling cycle of 2'-0-methyl residues relative to polymer-bound 5'- 
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hydroxyl. A 66-fold excess (120 \xL of 0.11 M = 13.2 ^imol) of alkylsilyl (ribo) protected 
phosphoramidite and a 150-fold excess of S-ethyl tetrazole (120 nL of 0.25 M = 30 fimol) 
can be used in each coupling cycle of ribo residues relative to polymer-bound 5'-hydroxyL 
Average coupling yields on the 394 Applied Biosystems, Inc. synthesizer, determined by 
5 colorimetric quantitation of the trityl fractions, are typically 97.5-99%. Other oligonucleotide 
synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer include; detritylation 
solution is 3% TCA in methylene chloride (ART); capping is performed with 16% TV-methyl 
imidazole in THF (ABT) and 10% acetic anhydride/10% 2,6-lutidine in THF (ABI); oxidation 
solution is 16.9 mM I 2 , 49 raM pyridine, 9% water in THF (PERSEPTIVE™). Burdick & 

1 0 Jackson Synthesis Grade acetonitrile is used directly from the reagent bottle. S-Ethyltetrazole 
solution (0.25 M in acetonitrile) is made up from the solid obtained from American 
International Chemical, Inc. Alternately, for the introduction of phosphorothioate linkages, 
Beaucage reagent (3H-l,2-Benzodithiol-3-one 1,1-dioxide 0.05 M in acetonitrile) is used. 

Deprotection of the RNA is performed using either a two-pot or one-pot protocol. For 
1 5 the two-pot protocol, the polymer-bound trityl-on oligoribonucleotide is transferred to a 4 mL 
glass screw top vial and suspended in a solution of 40% aq. methylamine (1 mL) at 65 °C for 
10 min. After cooling to -20 °C, the supernatant is removed from the polymer support. The 
support is washed three times with 1.0 mL of EtOH:MeCN:H20/3:l:l, vortex ed and the 
supernatant is then added to the first supernatant. The combined supernatants, containing the 
20 oligoribonucleotide, are dried to a white powder. The base deprotected oligoribonucleotide is 
resuspended in anhydrous TEA/HF/NMP solution (300 \iL of a solution of 1.5 mL N- 
methylpyrrolidinone, 750 \xL TEA and 1 mL TEA-3HF to provide a 1.4 M HF concentration) 
and heated to 65 °C. After 1.5 h, the oligomer is quenched with 1.5 M NH4HCO3. 

Alternatively, for the one-pot protocol, the polymer-bound trityl-on oligoribonucleotide 
25 is transferred to a 4 mL glass screw top vial and suspended in a solution of 33% ethanolic 
methylamine/DMSO: 1/1 (0.8 mL) at 65 °C for 15 min. The vial is brought to r.t. TEA-3HF 
(0.1 mL) is added and the vial is heated at 65 °C for 15 min. The sample is cooled at -20 °C 
and then quenched with 1 .5 M NH4HCO3. 

For purification of the trityl-on oligomers, the quenched NH4HCO3 solution is loaded 
30 onto a C-18 containing cartridge that had been prewashed with acetonitrile followed by 50 
mM TEAA. After washing the loaded cartridge with water, the RNA is detritylated with 
0.5% TFA for 13 min. The cartridge is then washed again with water, salt exchanged with 1 
M NaCl and washed with water again. The oligonucleotide is then eluted with 30% 
acetonitrile. 
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Inactive nucleic acid molecules or binding attenuated control (BAC) oligonucleotides 
can be synthesized by substituting one or more nucleotides in the nucleic acid molecule to 
inactivate the molecule and such molecules can serve as a negative control. 

The average stepwise coupling yields are typically >98% (Wincott et al, 1995 Nucleic 
5 Acids Res. 23, 2677-2684). Those of ordinary skill in the art will recognize that the scale of 
synthesis can be adapted to be larger or smaller than the example described above including 
but not limited to 96 well format, all that is important is the ratio of chemicals used in the 
reaction. 

Alternatively, the nucleic acid molecules of the present invention can be synthesized 
1 0 separately and joined together post-synthetically, for example by ligation (Moore et al, 1992, 
Science 256, 9923; Draper et al, International PCT publication No. WO 93/23569; 
Shabarova et al, 1991, Nucleic Acids Research 19, 4247; Bellon et al, 1997 Nucleosides & 
Nucleotides, 16, 951; Bellon et al, 1997, Bioconjugate Chem. 8, 204). 

The nucleic acid molecules of the present invention can be modified extensively to 
1 5 enhance stability by modification with nuclease resistant groups, for example, 2'-amino, T-C- 
allyl, 2'-flouro, 2'-0-methyl, 2'-H (for a review see Usman and Cedergren, 1992, TIBS 17, 34; 
Usman et al, 199 >4, Nucleic Acids Symp. Ser. 31, 163). Enzymatic nucleic acid molecules are 
purified by gel electrophoresis using known methods or are purified by high pressure liquid 
chromatography (HPLC; See Wincott et al, Supra, the totality of which is hereby 
20 incorporated herein by reference) and are re-suspended in water. 

The sequences of the nucleic acid molecules, including enzymatic nucleic acid 
molecules and antisense, that are chemically synthesized, are shown in the Tables herein. 
These sequences are representative only of many more such sequences where the enzymatic 
portion of the enzymatic nucleic acid molecule (all but the binding arms) is modified to affect 
25 activity. For example, the enzymatic nucleic acid sequences listed in the Tables can be 
formed of deoxyribonucleotides or other nucleotides or non-nucleotides. Such enzymatic 
nucleic acid molecules with enzymatic activity are equivalent to the enzymatic nucleic acid 
molecules described specifically in the Tables. 

Optimizing Activity of the Nucleic Acid Molecule of the Invention. 

30 Chemically synthesizing nucleic acid molecules with modifications (base, sugar and/or 

phosphate) that prevent their degradation by serum ribonucleases can increase their potency 
(see e.g., Eckstein et al, International Publication No. WO 92/07065; Perrault et al, 1990 
Nature 344, 565; Pieken et al, 1991, Science 253, 314; Usman and Cedergren, 1992, Trends 
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in Biochem. Sci. 17, 334; Usman et aL, International Publication No. WO 93/15187; and 
Rossi et aL, International Publication No. WO 91/03162; Sproat, US Patent No. 5,334,711; 
and Burgin et aL, supra, all of which are hereby incorporated by reference in their entirety). 
All of the above references describe various chemical modifications that can be made to the 
5 base, phosphate and/or sugar moieties of the nucleic acid molecules described herein. 
Modifications which enhance their efficacy in cells, and removal of bases from nucleic acid 
molecules to shorten oligonucleotide synthesis times and reduce chemical requirements are 
desired. 

There are several examples of sugar, base and phosphate modifications that can be 

10 introduced into nucleic acid molecules with significant enhancement in their nuclease 
stability and efficacy. For example, oligonucleotides can be modified to enhance stability 
and/or enhance biological activity by modification with nuclease resistant groups, for 
example, 2-amino, 2 , -C-allyl, 2-flouro, 2-0-methyl, 2'-H, nucleotide base modifications (for 
a review see Usman and Cedergren, 1992, TIBS. 17, 34; Usman et aL, 1994, Nucleic Acids 

15 Symp. Ser. 31, 163; Burgin et aL, 1996, Biochemistry , 35, 14090). Sugar modification of 
nucleic acid molecules are also known to increase efficacy (see Eckstein et aL, International 
Publication PCT No. WO 92/07065; Perrault et aL Nature, 1990, 344, 565-568; Pieken et 
aL Science, 1991, 253, 314-317; Usman and Cedergren, Trends in Biochem. Set , 1992, 17, 
334-339; Usman et aL International Publication PCT No. WO 93/15187; Sproat, US Patent 

20 No. 5,334,711 and Beigelman et aL, 1995, J. Biol. Chem., 270, 25702; Beigelman et aL, 
International PCT publication No. WO 97/26270; Beigelman et aL, US Patent No. 5,716,824; 
Usman et aL, US patent No. 5,627,053; Woolf et aL, International PCT Publication No. WO 
98/13526; Thompson et aL, USSN 60/082,404 which was filed on April 20, 1998; Karpeisky 
et aL, 1998, Tetrahedron Lett., 39, 1131; Earnshaw and Gait, 1998, Biopolymers (Nucleic 

25 acid Sciences), 48, 39-55; Verma and Eckstein, 1998, Annu. Rev. Biochem., 67, 99-134; and 
Burlina et aL, 1997, Bioorg. Med. Chem., 5, 1999-2010; all of the references are hereby 
incorporated in their totality by reference herein). The publications describe general methods 
and strategies to determine the location of incorporation of sugar, base and/or phosphate 
modifications and the like into enzymatic nucleic acid molecules without inhibiting catalysis. 

30 Similar modifications can be used as described herein to modify the nucleic acid molecules of 
the instant invention. 

While chemical modification of oligonucleotide internucleotide linkages with 
phosphorothioate, phosphorothioate, and/or 5'-methylphosphonate linkages improves 
stability, excessive modifications can cause some toxicity. Therefore, when designing nucleic 
35 acid molecules, the amount of these internucleotide linkages should be minimized. The 
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reduction in the concentration of these linkages can lower toxicity, resulting in increased 
efficacy and higher specificity of the therapeutic nucleic acid molecules. 

Nucleic acid molecules having chemical modifications that maintain or enhance activity 
are provided. Such nucleic acid molecules are also generally more resistant to nucleases than 
5 unmodified nucleic acid molecules. Thus, the in vitro and/or in vivo activity should not be 
significantly lowered. Therapeutic nucleic acid molecules delivered exogenously are 
optimally stable within cells until translation of the target RNA has been inhibited long 
enough to reduce the levels of the undesirable protein. This period of time varies between 
hours to days, depending upon the disease state. Nucleic acid molecules are preferably 
10 resistant to nucleases in order to function as effective intracellular therapeutic agents. 
Improvements in the chemical synthesis of RNA and DNA (Wincott et ah, 1995 Nucleic 
Acids Res. 23, 2677; Caruthers et al, 1992, Methods in Enzymology 211,3-19 (incorporated 
by reference herein)) have expanded the ability to modify nucleic acid molecules by 
introducing nucleotide modifications to enhance their nuclease stability as described above. 

15 In one embodiment, nucleic acid molecules of the invention include one or more G- 

clamp nucleotides. A G-clamp nucleotide is a modified cytosine analog wherein 
modifications result in the ability to hydrogen bond both Watson-Crick and Hoogsteen faces 
of a complementary guanine within a duplex, see for example Lin and Matteucci, 1998, J. 
Am, Chem. Soc, 120, 8531-8532. A single G-clamp analog substation within an 

20 oligonucleotide can result in substantially enhanced helical thermal stability and mismatch 
discrimination when hybridized to complementary oligonucleotides. The inclusion of such 
nucleotides in nucleic acid molecules of the invention can enable both enhanced affinity and 
specificity to nucleic acid targets. 

In another embodiment, the invention features conjugates and/or complexes of nucleic 
25 acid molecules targeting Ras genes such as K-Ras, H-Ras, and/or N-Ras. Compositions and 
conjugates are used to facilitate delivery of molecules into a biological system, such as cells. 
The conjugates provided by the instant invention can impart therapeutic activity by 
transferring therapeutic compounds across cellular membranes, altering the pharmacokinetics, 
and/or modulating the localization of nucleic acid molecules of the invention. The present 
30 invention encompasses the design and synthesis of novel agents for the delivery of molecules, 
including but not limited to, small molecules, lipids, phospholipids, nucleosides, nucleotides, 
nucleic acids, antibodies, toxins, negatively charged polymers and other polymers, for 
example proteins, peptides, hormones, carbohydrates, polyethylene glycols, or polyamines, 
across cellular membranes. In general, the transporters described are designed to be used 
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either individually or as part of a multi-component system, with or without degradable 
linkers. These compounds are expected to improve delivery and/or localization of nucleic 
acid molecules of the invention into a number of cell types originating from different tissues, 
in the presence or absence of serum (see Sullenger and Cech, US 5,854,038). Conjugates of 
5 the molecules described herein can be attached to biologically active molecules via linkers 
that are biodegradable, such as biodegradable nucleic acid linker molecules. 

The term "biodegradable nucleic acid linker molecule" as used herein, refers to a 
nucleic acid molecule that is designed as a biodegradable linker to connect one molecule to 
another molecule, for example, a biologically active molecule. The stability of the 

10 biodegradable nucleic acid linker molecule can be modulated by using various combinations 
of ribonucleotides, deoxyribonucleotides, and chemically modified nucleotides, for example 
2'-0-methyl, 2'-fluoro, 2'-amino, 2'-0-amino, 2 , -C-allyl, 2'-0-allyl, and other 2'-modified 
or base modified nucleotides. The biodegradable nucleic acid linker molecule can be a dimer, 
trimer, tetramer or longer nucleic acid molecule, for example, an oligonucleotide of about 2, 

15 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides in length, or can 
comprise a single nucleotide with a phosphorus based linkage, for example, a 
phosphoramidate or phosphodiester linkage. The biodegradable nucleic acid linker molecule 
can also comprise nucleic acid backbone, nucleic acid sugar, or nucleic acid base 
modifications. 

20 The term "biodegradable' 5 as used herein, refers to degradation in a biological system, 

for example, enzymatic degradation or chemical degradation. 

The term "biologically active molecule" as used herein, refers to compounds or 
molecules that are capable of eliciting or modifying a biological response in a system. Non- 
limiting examples of biologically active molecules contemplated by the instant invention 

25 include therapeutically active molecules such as antibodies, hormones, antivirals, peptides, 
proteins, chemotherapeutics, small molecules, vitamins, co-factors, nucleosides, nucleotides, 
oligonucleotides, enzymatic nucleic acids, antisense nucleic acids, triplex forming 
oligonucleotides, 2,5-A chimeras, siRNA, dsRNA, allozymes, aptamers, decoys and analogs 
thereof. Biologically active molecules of the invention also include molecules capable of 

30 modulating the pharmacokinetics and/or pharmacodynamics of other biologically active 
molecules, for example lipids and polymers such as polyamines, polyamides, polyethylene 
glycol and other polyethers. 

The term "phospholipid" as used herein, refers to a hydrophobic molecule comprising 
at least one phosphorus group. For example, a phospholipid can comprise a phosphorus 
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containing group and saturated or unsaturated alkyl group, optionally substituted with OH, 
COOH, oxo, amine, or substituted or unsubstituted aryl groups. 

Use of the nucleic acid-based molecules of the invention can lead to better treatment of 
the disease progression by affording the possibility of combination therapies (e.g., multiple 
5 antisense or enzymatic nucleic acid molecules targeted to different genes, nucleic acid 
molecules coupled with known small molecule inhibitors, or intermittent treatment with 
combinations of molecules (including different motifs) and/or other chemical or biological 
molecules). The treatment of subjects with nucleic acid molecules can also include 
combinations of different types of nucleic acid molecules. 

10 In the case that down-regulation of the target is desired, therapeutic nucleic acid 

molecules (e.g., DNAzymes) delivered exogenously are optimally stable within cells until 
translation of the target RNA has been inhibited long enough to reduce the levels of the 
targeted protein. This period of time varies between hours to days depending upon the 
disease state. These nucleic acid molecules should be resistant to nucleases in order to 

1 5 function as effective intracellular therapeutic agents. Improvements in the chemical synthesis 
of nucleic acid molecules described in the instant invention and others known in the art have 
expanded the ability to modify nucleic acid molecules by introducing nucleotide 
modifications to enhance their nuclease stability as described above. 

In another embodiment, nucleic acid catalysts having chemical modifications that 
20 maintain or enhance enzymatic activity are provided. Such nucleic acids are also generally 
more resistant to nucleases than unmodified nucleic acid. Thus, the in vitro and/or in vivo 
the activity of the nucleic acid should not be significantly lowered. As exemplified herein, 
such enzymatic nucleic acids are useful for in vitro and/or in vivo techniques even if activity 
over all is reduced 10 fold (Burgin et al, 1996, Biochemistry, 35, 14090). Such enzymatic 
25 nucleic acids herein are said to "maintain" the enzymatic activity of an all RNA ribozyme or 
all DNA DNAzyme. 

In another aspect the nucleic acid molecules comprise a 5' and/or a 3'- cap structure. 

By "cap structure" is meant chemical modifications, which have been incorporated at 
either terminus of the oligonucleotide (see, for example, Wincott et al, WO 97/26270, 
30 incorporated by reference herein). These terminal modifications protect the nucleic acid 
molecule from exonuclease degradation, and can help in delivery and/or localization within a 
cell. The cap can be present at the 5 '-terminus (5 '-cap) or at the 3 '-terminus (3 '-cap) or can 
be present on both termini. In non-limiting examples, the 5 '-cap includes inverted abasic 
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residue (moiety), 4',5 -methylene nucleotide; l-(beta-D-erythrofuranosyl) nucleotide, 4 , -thio 
nucleotide, carbocyclic nucleotide; 1,5-anhydrohexitol nucleotide; L-nucleotides; alpha- 
nucleotides; modified base nucleotide; phosphorodithioate linkage; /Ara>-pentofuranosyl 
nucleotide; acyclic 3 ! ,4'-seco nucleotide; acyclic 3,4-dihydroxybutyl nucleotide; acyclic 3,5- 
5 dihydroxypentyl nucleotide, 3 ! -3'-inverted nucleotide moiety; 3 f -3 , -inverted abasic moiety; 3- 
2'-inverted nucleotide moiety; 3-2-inverted abasic moiety; 1,4-butanediol phosphate; 3- 
phosphoramidate; hexylphosphate; aminohexyl phosphate; 3'-phosphate; 3 ? ~phosphorothioate; 
phosphorodithioate; or bridging or non-bridging methylphosphonate moiety (for more details 
see Wincott et ai, International PCT publication No. WO 97/26270, incorporated by 
1 0 reference herein). 

In another embodiment, the 3'-cap includes, for example ^S-methylene nucleotide; 1- 
(beta-D-erythrofiiranosyl) nucleotide; 4-thio nucleotide, carbocyclic nucleotide; S'-amino- 
alkyl phosphate; l,3-diamino-2-propyl phosphate, 3-aminopropyl phosphate; 6-aminohexyl 
phosphate; 1,2-aminododecyl phosphate; hydroxypropyl phosphate; 1,5-anhydrohexitol 

15 nucleotide; L-nucleotide; alpha-nucleotide; modified base nucleotide; phosphorodithioate; 
tfzreo-pentofuranosyl nucleotide; acyclic 3',4'-seco nucleotide; 3,4-dihydroxybutyl nucleotide; 
3,5-dihydroxypentyl nucleotide, 5'-5 -inverted nucleotide moiety; 5 -5 -inverted abasic moiety; 
5'-phosphoramidate; 5'-phosphorothioate; 1,4-butanediol phosphate; 5-amino; bridging 
and/or non-bridging 5-phosphoramidate, phosphorothioate and/or phosphorodithioate, 

20 bridging or non bridging methylphosphonate and S'-mercapto moieties (for more details see 
Beaucage and Iyer, 1993, Tetrahedron 49, 1925; incorporated by reference herein). 

By the term "non-nucleotide" is meant any group or compound winch can be 
incorporated into a nucleic acid chain in the place of one or more nucleotide units, including 
either sugar and/or phosphate substitutions, and allows the remaining bases to exhibit their 
25 enzymatic activity. The group or compound is abasic in that it does not contain a commonly 
recognized nucleotide base, such as adenosine, guanine, cytosine, uracil or thymine. 

The term "alkyl" as used herein refers to a saturated aliphatic hydrocarbon, including 
straight-chain, branched-chain "isoalkyl", and cyclic alkyl groups. The term "alkyl" also 
comprises alkoxy, alkyl-thio, alkyl-thio-alkyl, alkoxyalkyl, alkylamino, alkenyl, alkynyl, 
30 alkoxy, cycloalkenyl, cycloalkyl, cycloalkylalkyl, heterocycloalkyl, heteroaryl, C1-C6 
hydrocarbyl, aryl or substituted aryl groups. Preferably, the alkyl group has 1 to 12 carbons. 
More preferably it is a lower alkyl of from about 1 to 7 carbons, more preferably about 1 to 4 
carbons. The alkyl group can be substituted or unsubstituted. When substituted the 
substituted group(s) preferably comprise hydroxy, oxy, thio, amino, nitro, cyano, alkoxy, 
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alkyl-thio, alkyl-thio-alkyl, alkoxyalkyl, alkylamino, silyl, alkenyl, alkynyl, alkoxy, 
cycloalkenyl, cycloalkyl, cycloalkylalkyl, heterocycloalkyl, heteroaryl, C1-C6 hydrocarbyl, 
aryl or substituted aryl groups. The term "alkyl" also includes alkenyl groups containing at 
least one carbon-carbon double bond, including straight-chain, branched-chain, and cyclic 
5 groups. Preferably, the alkenyl group has about 2 to 12 carbons. More preferably it is a lower 
alkenyl of from about 2 to 7 carbons, more preferably about 2 to 4 carbons. The alkenyl 
group can be substituted or unsubstituted. When substituted the substituted group(s) 
preferably comprise hydroxy, oxy, thio, amino, nitro, cyano, alkoxy, alkyl-thio, alkyl-thio- 
alkyl, alkoxyalkyl, alkylamino, silyl, alkenyl, alkynyl, alkoxy, cycloalkenyl, cycloalkyl, 
10 cycloalkylalkyl, heterocycloalkyl, heteroaryl, C1-C6 hydrocarbyl, aryl or substituted aryl 
groups. 

The term "alkyl" also includes alkynyl groups containing at least one carbon-carbon 
triple bond, including straight-chain, branched-chain, and cyclic groups. Preferably, the 
alkynyl group has about 2 to 12 carbons. More preferably it is a lower alkynyl of from about 

15 2 to 7 carbons, more preferably about 2 to 4 carbons. The alkynyl group can be substituted or 
unsubstituted. When substituted the substituted group(s) preferably comprise hydroxy, oxy, 
thio, amino, nitro, cyano, alkoxy, alkyl-thio, alkyl-thio-alkyl, alkoxyalkyl, alkylamino, silyl, 
alkenyl, alkynyl, alkoxy, cycloalkenyl, cycloalkyl, cycloalkylalkyl, heterocycloalkyl, 
heteroaryl, C1-C6 hydrocarbyl, aryl or substituted aryl groups. Alkyl groups or moieties of 

20 the invention can also include aryl, alkylaryl, carbocyclic aryl, heterocyclic aryl, amide and 
ester groups. The preferred substituent(s) of aryl groups are halogen, trihalomethyl, hydroxyl, 
SH, OH, cyano, alkoxy, alkyl, alkenyl, alkynyl, and amino groups. An "alkylaryl". group 
refers to an alkyl group (as described above) covalently joined to an aryl group (as described 
above). Carbocyclic aryl groups are groups wherein the ring atoms on the aromatic ring are 

25 all carbon atoms. The carbon atoms are optionally substituted. Heterocyclic aryl groups are 
groups having from about 1 to 3 heteroatoms as ring atoms in the aromatic ring and the 
remainder of the ring atoms are carbon atoms. Suitable heteroatoms include oxygen, sulfur, 
and nitrogen, and include furanyl, thienyl, pyridyl, pyrrolyl, N-lower alkyl pyrrolo, pyrimidyl, 
pyrazinyl, imidazolyl and the like, all optionally substituted. An "amide" refers to an -C(O)- 

30 NH-R, where R is either alkyl, aryl, alkylaryl or hydrogen. An "ester" refers to an -C(0)-OR f , 
where R is either alkyl, aryl, alkylaryl or hydrogen. 

The term "alkoxyalkyl" as used herein refers to an alkyl-O-alkyl ether, for example, 
methoxyethyl or ethoxymethyl. 
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The term "alkyl-thio-alkyl" as used herein refers to an alkyl-S-alkyl thioether, for 
example, methylthiomethyl or methylthioethyl. 

The term "amino" as used herein refers to a nitrogen containing group as is known in 
the art derived from ammonia by the replacement of one or more hydrogen radicals by 
5 organic radicals. For example, the terms "aminoacyl" and "aminoalkyl" refer to specific N- 
substituted organic radicals with acyl and alkyl substituent groups respectively. 

The term "animation" as used herein refers to a process in which an amino group or 
substituted amine is introduced into an organic molecule. 

The term "exocyclic amine protecting moiety" as used herein refers to a nucleobase 
10 amino protecting group compatible with oligonucleotide synthesis, for example, an acyl or 
amide group. 

The term "alkenyl" as used herein refers to a straight or branched hydrocarbon of a 
designed number of carbon atoms containing at least one carbon-carbon double bond. 
Examples of "alkenyl" include vinyl, allyl, and 2-methyl-3-heptene. 

1 5 The term "alkoxy" as used herein refers to an alkyl group of indicated number of 

carbon atoms attached to the parent molecular moiety through an oxygen bridge. Examples 
of alkoxy groups include, for example, methoxy, ethoxy, propoxy and isopropoxy. 

The term "alkynyl" as used herein refers to a straight or branched hydrocarbon of a 
designed number of carbon atoms containing at least one carbon-carbon triple bond. 
20 Examples of "alkynyl" include propargyl, propyne, and 3-hexyne. 

The term "aryl" as used herein refers to an aromatic hydrocarbon ring system 
containing at least one aromatic ring. The aromatic ring can optionally be fused or otherwise 
attached to other aromatic hydrocarbon rings or non-aromatic hydrocarbon rings. Examples 
of aryl groups include, for example, phenyl, naphthyl, 1,2,3,4-tetrahydronaphthalene and 
25 biphenyl. Preferred examples of aryl groups include phenyl and naphthyl. 

The term "cycloalkenyr as used herein refers to a C3-C8 cyclic hydrocarbon 
containing at least one carbon-carbon double bond. Examples of cycloalkenyl include 
cyclopropenyl, cyclobutenyl, cyclopentenyl, cyclopentadiene, cyclohexenyl, 1,3- 
cyclohexadiene, cycloheptenyl, cycloheptatrienyl, and cyclooctenyl. 
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The term "cycloalkyl" as used herein refers to a C3-C8 cyclic hydrocarbon. Examples 
of cycloalkyl include cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl and 
cyclooctyl. 

The term "cycloalkylalkyl," as used herein, refers to a C3-C7 cycloalkyl group attached 
5 to the parent molecular moiety through an alkyl group, as defined above. Examples of 
cycloalkylalkyl groups include cyclopropylmethyl and cyclopentylethyl. 

The terms "halogen" or "halo" as used herein refers to indicate fluorine, chlorine, 
bromine, and iodine. 

The term "heterocycloalkyl," as used herein refers to a non-aromatic ring system 
10 containing at least one heteroatom selected from nitrogen, oxygen, and sulfur. The 
heterocycloalkyl ring can be optionally fused to or otherwise attached to other 
heterocycloalkyl rings and/or non-aromatic hydrocarbon rings. Preferred heterocycloalkyl 
groups have from 3 to 7 members. Examples of heterocycloalkyl groups include, for 
example, piperazine, morpholine, piperidine, tetrahydrofuran, pyrrolidine, and pyrazole. 
15 Preferred heterocycloalkyl groups include piperidinyl, piperazinyl, morpholinyl, and 
pyrolidinyl. 

The term "heteroaryl" as used herein refers to an aromatic ring system containing at 
least one heteroatom selected from nitrogen, oxygen, and sulfur. The heteroaryl ring can be 
fused or otherwise attached to one or more heteroaryl rings, aromatic or non-aromatic 

20 hydrocarbon rings or heterocycloalkyl rings. Examples of heteroaryl groups include, for 
example, pyridine, furan, thiophene, 5,6,7,8-tetrahydroisoquinoline and pyrimidine. Preferred 
examples of heteroaryl groups include thienyl, benzothienyl, pyridyl, quinolyl, pyrazinyl, 
pyrimidyl, imidazolyl, benzimidazolyl, furanyl, benzofiiranyl, thiazolyl, benzothiazolyl, 
isoxazolyl, oxadiazolyl, isothiazolyl, benzisothiazolyl, triazolyl, tetrazolyl, pyrrolyl, indolyl, 

25 pyrazolyl, and benzopyrazolyl. 

The term "C1-C6 hydrocarbyl" as used herein refers to straight, branched, or cyclic 
allcyl groups having 1-6 carbon atoms, optionally containing one or more carbon-carbon 
double or triple bonds. Examples of hydrocarbyl groups include, for example, methyl, ethyl, 
propyl, isopropyl, n-butyl, sec-butyl, tert-butyl, pentyl, 2-pentyl, isopentyl, neopentyl, hexyl, 
30 2-hexyl, 3-hexyl, 3-methylpentyl, vinyl, 2-pentene, cyclopropylmethyl, cyclopropyl, 
cyclohexylmethyl, cyclohexyl and propargyl. When reference is made herein to C1-C6 
hydrocarbyl containing one or two double or triple bonds it is understood that at least two 
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carbons are present in the alkyl for one double or triple bond,, and at least four carbons for two 
double or triple bonds. 

By "nucleotide" is meant a heterocyclic nitrogenous base in N-glycosidic linkage with 
a phosphorylated sugar. Nucleotides are recognized in the art to include natural bases 
5 (standard), and modified bases well known in the art. Such bases are generally located at the 
1* position of a nucleotide sugar moiety. Nucleotides generally comprise a base, sugar and a 
phosphate group. The nucleotides can be unmodified or modified at the sugar, phosphate 
and/or base moiety, (also referred to interchangeably as nucleotide analogs, modified 
nucleotides, non-natural nucleotides, non-standard nucleotides and other; see for example, 

10 Usman and McSwiggen, supra; Eckstein et al, International PCT Publication No. WO 
92/07065; Usman et al, International PCT Publication No. WO 93/15187; Uhlman & 
Peyman, supra all are hereby incorporated by reference herein. There are several examples of 
modified nucleic acid bases known in the art as summarized by Limbach et al., 1994, Nucleic 
Acids Res. 22, 2183. Some of the non-limiting examples of chemically modified and other 

15 natural nucleic acid bases that can be introduced into nucleic acids include, for example, 
inosine, purine, pyridin-4-one, pyridin-2-one, phenyl, pseudouracil, 2, 4, 6-trimethoxy 
benzene, 3-methyl uracil, dihydrouridine, naphthyl, aminophenyl, 5-alkylcytidines {e.g., 
5-methylcytidine), 5-alkyluridines (e.g., ribothymidine), 5-halouridine (e.g., 5-bromouridine) 
or 6-azapyrimidines or 6-alkylpyrimidines (e.g. 6-methyluridine), propyne, quesosine, 2- 

20 thiouridine, 4-thiouridine, wybutosine, wybutoxosine, 4-acetylcytidine, 5- 
(carboxyhydroxymethyl)uridine, 5 , -carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluridine, beta-D-galactosylqueosine, 1 -methyladenosine, 1 - 
methylinosine, 2,2-dimethylguanosine, 3-methylcytidine, 2-methyladenosine, 2- 
methylguanosine, N6-methyladenosine, 7-methylguanosine, 5-methoxyaminomethyl-2- 

25 thiouridine, 5-methylaminomethyluridine, 5-methylcarbonylmethyluridine, 5- 
methyloxyuridine, 5-methyl-2-thiouridine, 2-methylthio-N6-isopentenyladenosine, beta-D- 
mannosylqueosine, uridine-5-oxyacetic acid, 2-thiocytidine, threonine derivatives and others 
(Burgin et al, 1996, Biochemistry, 35, 14090; Uhlman & Peyman, supra). By "modified 
bases" in this aspect is meant nucleotide bases other than adenine, guanine, cytosine and 

30 uracil at V position or their equivalents; such bases can be used at any position, for example, 
within the catalytic core of an enzymatic nucleic acid molecule and/or in the substrate-binding 
regions of the nucleic acid molecule. 

By "nucleoside" is meant a heterocyclic nitrogenous base in N-glycosidic linkage with 
a sugar. Nucleosides are recognized in the art to include natural bases (standard), and 
35 modified bases well known in the art. Such bases are generally located at the V position of a 
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nucleoside sugar moiety. Nucleosides generally comprise a base and sugar group. The 
nucleosides can be unmodified or modified at the sugar, and/or base moiety (also referred to 
interchangeably as nucleoside analogs, modified nucleosides, non-natural nucleosides, non- 
standard nucleosides and other; see for example, Usman and McSwiggen, supra; Eckstein et 
5 al, International PCT Publication No. WO 92/07065; Usman et al, International PCT 
Publication No. WO 93/15187; Uhlman & Peyman, supra all are hereby incorporated by 
reference herein). There are several examples of modified nucleic acid bases known in the art 
as summarized by Limbach et al, 1994, Nucleic Acids Res. 22, 2183. Some of the non- 
limiting examples of chemically modified and other natural nucleic acid bases that can be 

1 0 introduced into nucleic acids include, inosine, purine, pyridin-4-one, pyridin-2-one, phenyl, 
pseudouracil, 2, 4, 6-trimethoxy benzene, 3-methyl uracil, dihydrouridine, naphthyl, 
aminophenyl, 5-alkylcytidines (e.g., 5-methylcytidine), 5-alkyluridines {e.g., ribothymidine), 
5-halouridine {e.g., 5-bromouridine) or 6-azapyrimidines or 6-aUcylpyrimidines {e.g. 6- 
methyluridine), propyne, quesosine, 2-thiouridine, 4-thiouridine, wybutosine, wybutoxosine, 

1 5 4-acetylcytidine, 5-(carboxyhydroxymethyl)uridine, 5 , -carboxymethylaminomethyl-2- 
thiouridine, 5-carboxymethylaminomethyluridine, beta-D-galactosylqueosine, 1 - 
methyladenosine, 1-methylinosine, 2,2-dimethylguanosine, 3-methylcytidine, 2- 
methyladenosine, 2-methylguanosine, N6-methyIadenosine, 7-methylguanosine, 5- 
methoxyaminomethyl-2-thiouridine, 5-methylaminomethyluridine, 5- 

20 methylcarbonylmethyluridine, 5-methyloxyuridine, 5-methyl-2-thiouridine, 2-methylthio-N6- 
isopentenyladenosine, beta-D-mannosylqueosine, uridine-5-oxyacetic acid, 2-thiocytidine, 
threonine derivatives and others (Burgin et al, 1996, Biochemistry, 35, 14090; Uhlman & 
Peyman, supra). By "modified bases" in this aspect is meant nucleoside bases other than 
adenine, guanine, cytosine and uracil at l 1 position or their equivalents; such bases can be 

25 used at any position, for example, within the catalytic core of an enzymatic nucleic acid 
molecule and/or in the substrate-binding regions of the nucleic acid molecule. 

In one embodiment, the invention features modified enzymatic nucleic acid molecules 
with phosphate backbone modifications comprising one or more phosphorothioate, 
phosphorodithioate, methylphosphonate, morpholino, amidate carbamate, carboxymethyl, 

30 acetamidate, polyamide, sulfonate, sulfonamide, sulfamate, fonnacetal, thioformacetal, 
and/or alkylsilyl, substitutions. For a review of oligonucleotide backbone modifications see 
Hunziker and Leumann, 1995, Nucleic Acid Analogues: Synthesis and Properties, in Modern 
Synthetic Methods, VCH, 331-417, and Mesmaeker et al., 1994, Novel Backbone 
Replacements for Oligonucleotides, in Carbohydrate Modifications in Antisense Research, 

35 ACS, 24-39. These references are hereby incorporated by reference herein. 
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By "abasic" is meant sugar moieties lacking a base or having other chemical groups in 
place of a base at the l 1 position, for example a 3\3'-linked or 5',5Minked deoxyabasic 
ribose derivative (for more details see Wincott et al, International PCT publication No. WO 
97/26270). 

5 By "unmodified nucleoside" is meant one of the bases adenine, cytosine, guanine, 

thymine, uracil joined to the 1 ' carbon of p-D-ribo-furanose. 

By "modified nucleoside" is meant any nucleotide base which contains a modification 
in the chemical structure of an unmodified nucleotide base, sugar and/or phosphate. 

In connection with 2'-modified nucleotides as described for the present invention, by 
10 "amino" is meant 2'-NH2 or 2'-0- NH 2 , which can be modified or unmodified. Such 
modified groups are described, for example, in Eckstein et al, U.S. Patent 5,672,695 and 
Matulic-Adamic et al, WO 98/28317, respectively, which are both incorporated by reference 
in their entireties. 

Various modifications to nucleic acid (e.g., DNAzyme) structure can be made to 
1 5 enhance the utility of these molecules. For example, such modifications can enhance shelf- 
life, half-life in vitro, stability, and ease of introduction of such oligonucleotides to the target 
site, including e.g., enhancing penetration of cellular membranes and conferring the ability to 
recognize and bind to targeted cells. 

Use of these molecules can lead to better treatment of the disease progression by 
20 affording the possibility of combination therapies {e.g., multiple enzymatic nucleic acid 
molecules targeted to different genes, enzymatic nucleic acid molecules coupled with known 
small molecule inhibitors, or intermittent treatment with combinations of enzymatic nucleic 
acid molecules (including different enzymatic nucleic acid molecule motifs) and/or other 
chemical or biological molecules). The treatment of subjects with nucleic acid molecules can 
25 also include combinations of different types of nucleic acid molecules. Therapies can be 
devised which include a mixture of enzymatic nucleic acid molecules (including different 
enzymatic nucleic acid molecule motifs), antisense and/or 2-5A chimera molecules to one or 
more targets to alleviate symptoms of a disease. 

Administration of Nucleic Acid Molecules 

30 Methods for the delivery of nucleic acid molecules are described in Akhtar et al, 1992, 

Trends Cell Bio., 2, 139; and Delivery Strategies for Antisense Oligonucleotide Therapeutics, 
ed. Akhtar, 1995, which are both incorporated herein by reference. Sullivan et al, PCT WO 
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94/02595, further describes the general methods for delivery of enzymatic RNA molecules. 
These protocols can be utilized for the delivery of virtually any nucleic acid molecule. 
Nucleic acid molecules can be administered to cells by a variety of methods known to those 
familiar to the art, including, but not restricted to, encapsulation in liposomes, by 
5 iontophoresis, or by incorporation into other vehicles, such as hydrogels, cyclodextrins, 
biodegradable nanocapsules, and bioadhesive microspheres. Alternatively, the nucleic 
acid/vehicle combination is locally delivered by direct injection or by use of an infusion 
pump. Other routes of delivery include, but are not limited to oral (tablet or pill form) and/or 
intrathecal delivery (Gold, 1997, Neuroscience, 76, 1153-1158). Other approaches include the 

1 0 use of various transport and carrier systems, for example though the use of conjugates and 
biodegradable polymers. For a comprehensive review on drug delivery strategies including 
CNS delivery, see Ho et al t 1999, Curr. Opin. Mol Ther., 1, 336-343 and Jain, Drug 
Delivery Systems: Technologies and Commercial Opportunities, Decision Resources, 1998 
and Groothuis et al, 1991, J. NeuroVirol, 3, 387-400. More detailed descriptions of nucleic 

15 acid delivery and administration are provided in Sullivan et al, supra, Draper et al, PCT 
W093/23569, Beigelman et al, PCT WO99/05094, and Klimuk et al, PCT WO99/04819, all 
of which have been incorporated by reference herein. 

The molecules of the instant invention can be used as pharmaceutical agents. 
Pharmaceutical agents prevent, inhibit the occurrence, or treat (alleviate a symptom to some 
20 extent, preferably all of the symptoms) of a disease state in a subject. 

The negatively charged polynucleotides of the invention can be administered {e.g., 
RNA, DNA or protein) and introduced into a subject by any standard means described herein 
and known in the art, with or without stabilizers, buffers, and the like, to form a 
pharmaceutical composition. When it is desired to use a liposome delivery mechanism, 
25 standard protocols for formation of liposomes can be followed. The compositions of the 
present invention can also be formulated and used as tablets, capsules or elixirs for oral 
administration; suppositories for rectal administration; sterile solutions; suspensions for 
injectable administration; and the other compositions known in the art. 

The present invention also includes pharmaceutically acceptable formulations of the 
30 compounds described. These formulations include salts of the above compounds, e.g., acid 
addition salts, for example, salts of hydrochloric, hydrobromic, acetic acid, and benzene 
sulfonic acid. 

A pharmacological composition or formulation refers to a composition or formulation 
in a form suitable for administration, e.g., systemic administration, into a cell or subject, 
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preferably a human. Suitable forms, in part, depend upon the use or the route of entry, for 
example oral, transdermal, or by injection. Such forms should not prevent the composition or 
formulation from reaching a target cell (/.&, a cell to which the negatively charged polymer is 
desired to be delivered to). For example, pharmacological compositions injected into the 
5 blood stream should be soluble. Other factors are known in the art, and include 
considerations such as toxicity and forms which prevent the composition or formulation from 
exerting its effect. 

By "systemic administration" is meant in vivo systemic absorption or accumulation of 
drugs in the blood stream followed by distribution throughout the entire body. 

10 Administration routes which lead to systemic absorption include, without limitations: 
intravenous, subcutaneous, intraperitoneal, inhalation, oral, intrapulmonary and 
intramuscular. Each of these administration routes expose the desired negatively charged 
polymers, e.g., nucleic acids, to an accessible diseased tissue. The rate of entry of a drug into 
the circulation has been shown to be a function of molecular weight or size. The use of a 

15 liposome or other drug carrier comprising the compounds of the instant invention can 
potentially localize the drug, for example, in certain tissue types, such as the tissues of the 
reticular endothelial system (RES). A liposome formulation that can facilitate the association 
of drug with the surface of cells, such as, lymphocytes and macrophages is also useful. This 
approach can provide enhanced delivery of the drug to target cells by taking advantage of the 

20 specificity of macrophage and lymphocyte immune recognition of abnormal cells, such as 
cancer cells. 

By pharmaceutically acceptable formulation is meant, a composition or formulation that 
allows for the effective distribution of the nucleic acid molecules of the instant invention in 
the physical location most suitable for their desired activity. Non-limiting examples of agents 

25 suitable for formulation with the nucleic acid molecules of the instant invention include: 
PEG conjugated nucleic acids, phospholipid conjugated nucleic acids, nucleic acids 
containing lipophilic moieties, phosphorothioates, P-glycoprotein inhibitors (such as Pluronic 
P85) which can enhance entry of drugs into various tissues, for exaple the CNS (Jolliet-Riant 
and Tillement, 1999, Fundam. Clin. Pharmacol., 13, 16-26); biodegradable polymers, such as 

30 poly (DL-lactide-coglycolide) microspheres for sustained release delivery after implantation 
(Emerich, DF et al, 1999, Cell Transplant, 8, 47-58) Alkermes, Inc. Cambridge, MA; and 
loaded nanoparticles, such as those made of polybutylcyanoacrylate, which can deliver drugs 
across the blood brain barrier and can alter neuronal uptake mechanisms (Prog 
Neuropsychopharmacol Biol Psychiatry, 23, 941-949, 1999). Other non-limiting examples of 

35 delivery strategies, including CNS delivery of the nucleic acid molecules of the instant 
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invention include material described in Boado et al, 1998, J. Pharm. Set, 87, 1308-1315; 
Tyler et al, 1999, FEB S Lett, 421, 280-284; Pardridge et al, 1995, PNAS USA., 92, 5592- 
5596; Boado, 1995, Adv. Drug Delivery Rev., 15, 73-107; Aldrian-Herrada et al, 1998, 
Nucleic Acids Res., 26, 4910-4916; and Tyler et al, 1999, PNAS USA., 96, 7053-7058. All 
5 these references are hereby incorporated herein by reference. 

The invention also features the use of the composition comprising surface-modified 
liposomes containing poly (ethylene glycol) lipids (PEG-modified, or long-circulating 
liposomes or stealth liposomes). Nucleic acid molecules of the invention can also comprise 
covalently attached PEG molecules of various molecular weights. These formulations offer a 

1 0 method for increasing the accumulation of drugs in target tissues. This class of drug carriers 
resists opsonization and elimination by the mononuclear phagocytic system (MPS or RES), 
thereby enabling longer blood circulation times and enhanced tissue exposure for the 
encapsulated drug (Lasic et al. Chem. Rev. 1995, 95, 2601-2627; Ishiwata et al, Chem. 
Pharm. Bull. 1995, 43, 1005-1011). Such liposomes have been shown to accumulate 

1 5 selectively in tumors, presumably by extravasation and capture in the neovascularized target 
tissues (Lasic et al, Science 1995, 267, 1275-1276; Oku et a/., 1995, Biochim. Biophys. Acta, 
1238, 86-90). The long-circulating liposomes enhance the pharmacokinetics and 
pharmacodynamics of DNA and RNA, particularly compared to conventional cationic 
liposomes, which are known to accumulate in tissues of the MPS (Liu et al, J. Biol Chem. 

20 1995, 42, 24864-24870; Choi et al, International PCT Publication No. WO 96/10391; Ansell 
et al, International PCT Publication No. WO 96/10390; Holland et al, International PCT 
Publication No. WO 96/10392; all of which are incorporated by reference herein). Long- 
circulating liposomes are also likely to protect drugs from nuclease degradation to a greater 
extent compared to cationic liposomes, based on their ability to avoid accumulation in 

25 metabolically aggressive MPS tissues such as the liver and spleen. All of these references are 
incorporated by reference herein. 

The present invention also includes compositions prepared for storage or administration 
that include a pharmaceutically effective amount of the desired compounds in a 
pharmaceutically acceptable carrier or diluent. Acceptable carriers or diluents for therapeutic 
30 use are well known in the pharmaceutical art, and are described, for example, in Remington's 
Pharmaceutical Sciences, Mack Publishing Co. (A.R. Gennaro edit. 1985), hereby 
incorporated by reference herein. For example, preservatives, stabilizers, dyes and flavoring 
agents can be provided. These include sodium benzoate, sorbic acid and esters of p- 
hydroxybenzoic acid. In addition, antioxidants and suspending agents can be used. 
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A pharmaceutically effective dose is that dose required to prevent, inhibit the 
occurrence, or treat (alleviate a symptom to some extent, preferably all of the symptoms) of a 
disease state. The pharmaceutically effective dose depends on the type of disease, the 
composition used, the route of administration, the type of mammal being treated, the physical 
5 characteristics of the specific mammal under consideration, concurrent medication, and other 
factors which those skilled in the medical arts will recognize. Generally, an amount between 
0.1 rng/kg and 100 mg/kg body weight/day of active ingredients is administered dependent 
upon potency of the negatively charged polymer. 

The nucleic acid molecules of the invention and formulations thereof can be 
1 0 administered orally, topically, parenterally, by inhalation or spray, or rectally in dosage unit 
formulations containing conventional non-toxic pharmaceutically acceptable carriers, 
adjuvants and/or vehicles. The term parenteral as used herein includes percutaneous, 
subcutaneous, intravascular (e.g., intravenous), intramuscular, or intrathecal injection or 
infusion techniques and the like. In addition, there is provided a pharmaceutical formulation 
15 comprising a nucleic acid molecule of the invention and a pharmaceutically acceptable 
carrier. One or more nucleic acid molecules of the invention can be present in association 
with one or more non-toxic pharmaceutically acceptable carriers and/or diluents and/or 
adjuvants, and if desired other active ingredients. The pharmaceutical compositions 
containing nucleic acid molecules of the invention can be in a form suitable for oral use, for 
20 example, as tablets, troches, lozenges, aqueous or oily suspensions, dispersible powders or 
granules, emulsion, hard or soft capsules, or syrups or elixirs. 

Compositions intended for oral use can be prepared according to any method known to 
the art for the manufacture of pharmaceutical compositions and such compositions can 
contain one or more such sweetening agents, flavoring agents, coloring agents or preservative 

25 agents in order to provide pharmaceutically elegant and palatable preparations. Tablets 
contain the active ingredient in admixture with non-toxic pharmaceutically acceptable 
excipients that are suitable for the manufacture of tablets. These excipients can be, for 
example, inert diluents, such as calcium carbonate, sodium carbonate, lactose, calcium 
phosphate or sodium phosphate; granulating and disintegrating agents, for example, corn 

30 starch, or alginic acid; binding agents, for example starch, gelatin or acacia, and lubricating 
agents, for example magnesium stearate, stearic acid or talc. The tablets can be uncoated or 
they can be coated by known techniques. In some cases such coatings can be prepared by 
known techniques to delay disintegration and absorption in the gastrointestinal tract and 
thereby provide a sustained action over a longer period. For example, a time delay material 

35 such as glyceryl monosterate or glyceryl distearate can be employed. 
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Formulations for oral use can also be presented as hard gelatin capsules wherein the 
active ingredient is mixed with an inert solid diluent, for example, calcium carbonate, calcium 
phosphate or kaolin, or as soft gelatin capsules wherein the active ingredient is mixed with 
water or an oil medium, for example peanut oil, liquid paraffin or olive oil. 

5 Aqueous suspensions contain the active materials in admixture with excipients suitable 

for the manufacture of aqueous suspensions. Such excipients are suspending agents, for 
example, sodium carboxymethylcellulose, methylcellulose, hydropropyl-methylcellulose, 
sodium alginate, polyvinylpyrrolidone, gum tragacanth and gum acacia; dispersing or wetting 
agents can be a naturally-occurring phosphatide, for example, lecithin, or condensation 

1 0 products of an alkylene oxide with fatty acids, for example polyoxyethylene stearate, or 
condensation products of ethylene oxide with long chain aliphatic alcohols, for example 
heptadecaethyleneoxycetanol, or condensation products of ethylene oxide with partial esters 
derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 
condensation products of ethylene oxide with partial esters derived from fatty acids and 

1 5 hexitol anhydrides, for example polyethylene sorbitan monooleate. The aqueous suspensions 
can also contain one or more preservatives, for example, ethyl, or n-propyl p- 
hydroxybenzoate, one or more coloring agents, one or more flavoring agents, and one or more 
sweetening agents, such as sucrose or saccharin. 

Oily suspensions can be formulated by suspending the active ingredients in a vegetable 
20 oil, for example arachis oil, olive oil, sesame oil or coconut oil, or in a mineral oil such as 
liquid paraffin. The oily suspensions can contain a thickening agent, for example beeswax, 
hard paraffin or cetyl alcohol. Sweetening agents and flavoring agents can be added to 
provide palatable oral preparations. These compositions can be preserved by the addition of 
an anti-oxidant such as ascorbic acid. 

25 Dispersible powders and granules suitable for preparation of an aqueous suspension by 

the addition of water provide the active ingredient in admixture with a dispersing or wetting 
agent, suspending agent and one or more preservatives. Suitable dispersing or wetting agents 
or suspending agents are exemplified by those already mentioned above. Additional 
excipients, for example sweetening, flavoring and coloring agents, can also be present. 

30 Pharmaceutical compositions of the invention can also be in the form of oil-in-water 

emulsions. The oily phase can be a vegetable oil or a mineral oil or mixtures of these. 
Suitable emulsifying agents can be naturally-occurring gums, for example gum acacia or gum 
tragacanth, naturally-occurring phosphatides, for example soy bean, lecithin, and esters or 
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partial esters derived from fatty acids and hexitol, anhydrides, for example, sorbitan 
monooleate, and condensation products of the said partial esters with ethylene oxide, for 
example polyoxyethylene sorbitan monooleate. The emulsions can also contain sweetening 
and flavoring agents. 

5 Syrups and elixirs can be formulated with sweetening agents, for example glycerol, 

propylene glycol, sorbitol, glucose or sucrose. Such formulations can also contain a 
demulcent, a preservative and flavoring and coloring agents. The pharmaceutical 
compositions can be in the form of a sterile injectable aqueous or oleaginous suspension. 
This suspension can be formulated according to the known art using those suitable dispersing 

10 or wetting agents and suspending agents that have been mentioned above. The sterile 
injectable preparation can also be a sterile injectable solution or suspension in a non-toxic 
parentally acceptable diluent or solvent, for example as a solution in 1,3-butanediol. Among 
the acceptable vehicles and solvents that can be employed are water, Ringer's solution and 
isotonic sodium chloride solution. In addition, sterile, fixed oils are conventionally employed 

15 as a solvent or suspending medium. For this purpose any bland fixed oil can be employed 
including synthetic mono-or diglycerides. In addition, fatty acids such as oleic acid find use 
in the preparation of injectables. 

The nucleic acid molecules of the invention can also be administered in the form of 
suppositories, e.g., for rectal administration of the drug. These compositions can be prepared 
20 by mixing the drug with a suitable non-irritating excipient that is solid at ordinary 
temperatures but liquid at the rectal temperature and will therefore melt in the rectum to 
release the drug. Such materials include cocoa butter and polyethylene glycols. 

Nucleic acid molecules of the invention can be administered parenterally in a sterile 
medium. The drug, depending on the vehicle and concentration used, can either be suspended 
25 or dissolved in the vehicle. Advantageously, adjuvants such as local anesthetics, 
preservatives and buffering agents can be dissolved in the vehicle. 

Dosage levels of the order of from about 0.1 mg to about 140 mg per kilogram of body 
weight per day are useful in the treatment of the above-indicated conditions (about 0.5 mg to 
about 7 g per patient or subject per day). The amount of active ingredient that can be 
30 combined with the carrier materials to produce a single dosage form varies depending upon 
the host treated and the particular mode of administration. Dosage unit forms generally 
contain between from about 1 mg to about 500 mg of an active ingredient. 
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It is understood that the specific dose level for any particular patient or subject depends 
upon a variety of factors including the activity of the specific compound employed, the age, 
body weight, general health, sex, diet, time of administration, route of administration, and rate 
of excretion, drug combination and the severity of the particular disease undergoing therapy. 

5 For administration to non-human animals, the composition can also be added to the 

animal feed or drinking water. It can be convenient to formulate the animal feed and drinking 
water compositions so that the animal takes in a therapeutically appropriate quantity of the 
composition along with its diet. It can also be convenient to present the composition as a 
premix for addition to the feed or drinking water. 

10 The nucleic acid molecules of the present invention can also be administered to a 

patient or subject in combination with other therapeutic compounds to increase the overall 
therapeutic effect. The use of multiple compounds to treat an indication can increase the 
beneficial effects while reducing the presence of side effects. 

In another aspect of the invention, nucleic acid molecules of the present invention are 
1 5 preferably expressed from transcription units (see for example Couture et al, 1996, TIG., 12, 
510, Skillern et al, International PCT Publication No. WO 00/22113, Conrad, International 
PCT Publication No. WO 00/22114, and Conrad, US 6,054,299) inserted into DNA or RNA 
vectors. The recombinant vectors are preferably DNA plasmids or viral vectors. Enzymatic 
nucleic acid expressing viral vectors can be constructed based on, but not limited to, adeno- 
20 associated virus, retrovirus, adenovirus, or alphavirus. Preferably, the recombinant vectors 
capable of expressing the nucleic acid molecules are delivered as described above, and persist 
in target cells. Alternatively, viral vectors can be used that provide for transient expression of 
nucleic acid molecules. Such vectors can be repeatedly administered as necessary. Once 
expressed, the nucleic acid molecule binds to the target mRNA. Delivery of nucleic acid 
25 molecule expressing vectors can be systemic, such as by intravenous or intra-muscular 
administration, by administration to target cells ex-planted from the subject followed by 
reintroduction into the subject, or by any other means that would allow for introduction into 
the desired target cell (for a review see Couture et al, 1996, TIG., 12, 510). 

One aspect of the invention features an expression vector comprising a nucleic acid 
30 sequence encoding at least one of the nucleic acid molecules of the instant invention. The 
nucleic acid sequence encoding the nucleic acid molecule of the instant invention is operably 
linked in a manner that allows expression of that nucleic acid molecule. 
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Another aspect the invention features an expression vector comprising nucleic acid 
sequence encoding at least one of the nucleic acid molecules of the invention, in a manner 
which allows expression of that nucleic acid molecule. The expression vector comprises in 
one embodiment; a) a transcription initiation region; b) a transcription termination region; c) 
5 a nucleic acid sequence encoding at least one said nucleic acid molecule; and wherein said 
sequence is operably linked to said initiation region and said termination region, in a manner 
that allows expression and/or delivery of said nucleic acid molecule. 

In another embodiment, the expression vector comprises: a) a transcription initiation 
region; b) a transcription termination region; c) an open reading frame; d) a nucleic acid 

1 0 sequence encoding at least one said nucleic acid molecule, wherein said sequence is operably 
linked to the 3 '-end of said open reading frame; and wherein said sequence is operably linked 
to said initiation region, said open reading frame and said termination region, in a manner 
which allows expression and/or delivery of said nucleic acid molecule. In yet another 
embodiment the expression vector comprises: a) a transcription initiation region; b) a 

1 5 transcription termination region; c) an intron; d) a nucleic acid sequence encoding at least one 
said nucleic acid molecule; and wherein said sequence is operably linked to said initiation 
region, said intron and said termination region, in a manner which allows expression and/or 
delivery of said nucleic acid molecule. 

In another embodiment, the expression vector comprises: a) a transcription initiation 
20 region; b) a transcription termination region; c) an intron; d) an open reading frame; e) a 
nucleic acid sequence encoding at least one said nucleic acid molecule, wherein said sequence 
is operably linked to the 3 '-end of said open reading frame; and wherein said sequence is 
operably linked to said initiation region, said intron, said open reading frame and said 
termination region, in a manner which allows expression and/or delivery of said nucleic acid 
25 molecule. 

Examples 

The following are non-limiting examples showing the selection, isolation, synthesis and 
activity of nucleic acids of the instant invention. 

Example 1: Identification of Potential Target Sites in Human Ras RNA 

30 The sequence of human Ras genes were screened for accessible sites using a computer- 

folding algorithm. Regions of the RNA that do not form secondary folding structures and 
contain potential enzymatic nucleic acid molecule and/or antisense binding/cleavage sites 
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were identified. The sequences of K-Ras and H-Ras binding/cleavage sites are shown in 
Tables II and III. 

Example 2: Selection of Enzymatic Nucleic Acid Cleavage Sites in Human Ras RNA 

Enzymatic nucleic acid molecule target sites were chosen by analyzing sequences of 
5 Human K-Ras and H-Ras (for example, Genbank accession Nos: NM_004985 and 
NM_005343 respectively) and prioritizing the sites on the basis of folding. Enzymatic 
nucleic acid molecules were designed that can bind each target and were individually 
analyzed by computer folding (Christoffersen et al. t 1994 1 MoL Struc. Theochem, 311, 273; 
Jaeger et al, 1989, Proc. Natl. Acad. Sci USA, 86, 7706) to assess whether the enzymatic 
10 nucleic acid molecule sequences fold into the appropriate secondary structure. Those 
enzymatic nucleic acid molecules with unfavorable intramolecular interactions between the 
binding arms and the catalytic core are eliminated from consideration. As noted below, 
varying binding arm lengths can be chosen to optimize activity. Generally, at least 5 bases on 
each arm are able to bind to, or otherwise interact with, the target RNA. 

15 Example 3: Chemical Synthesis and Purification of Enzymatic Nucleic Acid Molecules for 
Efficient Cleavage and/or blocking of Ras RNA 

DNAzyme molecules are designed to anneal to various sites in the RNA message. The 
binding arms of the DNAzyme molecules are complementary to the target site sequences 
described above. The DNAzymes were chemically synthesized. The method of synthesis 

20 used followed the procedure for nucleic acid synthesis as described herein and in Usman et 
al, (1987 J. Am. Chem. Soc, 109, 7845), Scaringe et al, (1990 Nucleic Acids Res., 18, 
5433) and Wincott et al, supra, and made use of common nucleic acid protecting and 
coupling groups, such as dimethoxytrityl at the 5-end, and phosphoramidites at the 3 -end. 
The average stepwise coupling yields were typically >98%. The sequences of the chemically 

25 synthesized DNAzyme molecules used in this study are shown below in Tables II and ED. 

Example 4: DNAzyme Cleavage of Ras RNA Target in vitro 

DNAzymes targeted to the human K-Ras and H-Ras RNA are designed and synthesized 
as described above. These enzymatic nucleic acid molecules can be tested for cleavage 
activity in vitro, for example, using the following procedure. The target sequences and the 
30 nucleotide location within the K-Ras and H-Ras RNA are given in Tables II and III 
respectively. 
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Cleavage Reactions: 

DNAzymes and substrates were synthesized in 96-well format using 0.2pjmol scale. 
Substrates were 5'- 32 P labeled and gel purified using 7.5% polyacrylamide gels, and eluting 
into water. Assays were done by combining trace substrate with 500nM DNAzyme or 
5 greater, and initiated by adding final concentrations of 40mM Mg +2 , and 50mM Tris-Cl pH 
8.0. For each DNAzyme/substrate combination a control reaction was done to ensure 
cleavage was not the result of non-specific substrate degradation. A single three hour time 
point was taken and run on a 15% polyacrylamide gel to asses cleavage activity. Gels were 
dried and scanned using a Molecular Dynamics Phosphorimager and quantified using 
1 0 Molecular Dynamics ImageQuant software. Percent cleaved was determined by dividing 
values for cleaved substrate bands by full-length (uncleaved) values plus cleaved values and 
multiplying by 100 (%cleaved=[C/(U+C)]*100). 

Example 5: DNAzyme Cleavage of Ras RNA Target in vivo 
Cell Culture 

15 Wickstrom, 2001, Mol BiotechnoL, 18, 35-35, describes a cell culture system in which 

antisense oligonucleotides targeting H-Ras were studied in transformed mouse cells that form 
solid tumors. Treatment of cells with antisense targeting H-Ras resulted in the sequence 
specific and dose dependent inhibition of H-Ras expression. In this study, it was determined 
that antisense targeting the first intron region of H-Ras were more effective than antisense 

20 targeting the initiation codon region. 

Kita et ah, 1999, Int. J. Cancer, 80, 553-558, describes the growth inhibition of human 
pancreatic cancer cell lines by antisense oligonucleotides specific to mutated K-Ras genes. 
Antisense oligonucleotides were transfected to the transformed cells using liposomes. 
Cellular proliferation, K-Ras mRNA expression, and K-Ras protein synthesis were all 
25 evaluated as endpoints. Sato et al, 2000, Cancer Lett, 155, 153-161, describes another 
human pancreatic cancer cell line, HOR-Pl, that is characterized by high angiogenic activity 
and metastatic potential. Genetic and molecular analysis of this cell line revealed both 
increased telomerase activity and a mutation in the K-Ras oncogene. 

A variety of endpoints have been used in cell culture models to look at Ras-mediated 
30 effects after treatment with anti-Ras agents. Phenotypic endpoints include inhibition of cell 
proliferation, RNA expression, and reduction of Ras protein expression. Because Ras 
oncogene mutations are directly associated with increased proliferation of cetain tumor cells, 
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a proliferation endpoint for cell culture assays is preferably used as the primary screen. There 
are several methods by which this endpoint can be measured. Following treatment of cells 
with DNAzymes, cells are allowed to grow (typically 5 days) after which either the cell 
viability, the incorporation of [ 3 H] thymidine into cellular DNA and/or the cell density can be 
5 measured. The assay of cell density is done in a 96-well format using commercially available 
fluorescent nucleic acid stains (such as Syto® 13 or CyQuant®). As a secondary, 
confirmatory endpoint a DNAzyme-mediated decrease in the level of Ras protein expression 
is evaluated using a Ras-specific ELISA. 

Animal Models 

1 0 Evaluating the efficacy of anti-Ras agents in animal models is an important prerequisite 

to human clinical trials. As in cell culture models, the most Ras sensitive mouse tumor 
xenografts are those derived from cancer cells that express mutant Ras proteins. Nude mice 
bearing H-Ras transformed bladder cancer cell xenografts were sensitive to an anti-Ras 
antisense nucleic acid, resulting in an 80% inhibition of tumor growth after a 31 day treatment 

15 period (Wickstrom, 2001, Mol BiotechnoL, 18, 35-35). Zhang et al, 2000, Gene Ther., 7, 
2041, describes an anti-K-Ras ribozyme adenoviral vector (KRbz-ADV) targeting a K-Ras 
mutant (K-Ras codon 12 GGT to GTT; H441 and H1725 cells respectively). Non-small cell 
lung cancer cells (NSCLC H441 and HI 725 cells) that express the mutant K-Ras protein were 
used in nude mouse xenografts compared to NSCLC HI 650 cells that lack the relevant 

20 mutation. Pre-treatment with KRbz-ADV completely abrogated engraftment of both H441 
and HI 725 cells and compared to 100% engraftment and tumor growth in animals that 
received untreated tumor cells or a control vector. The above studies provide proof that 
inhibition of Ras expression by anti-Ras agents causes inhibition of tumor growth in animals. 
Anti-Ras DNAzymes chosen from in vitro assays are further tested in similar mouse 

25 xenograft models. Active DNAzymes are subsequently tested in combination with standard 
chemotherapies. 

Indications 

Particular degenerative and disease states that are associated with Ras expression 
modulation include but are not limited to cancer, for example lung cancer, colorectal cancer, 
30 bladder cancer, pancreatic cancer, breast cancer, prostate cancer and/or any other diseases or 
conditions that are related to or will respond to the levels of Ras in a cell or tissue, alone or in 
combination with other therapies. 
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The present body of knowledge in Ras research indicates the need for methods to assay 
Ras activity and for compounds that can regulate Ras expression for research, diagnostic, and 
therapeutic use. 

The use of monoclonal antibodies, chemotherapy, radiation therapy, and analgesics, are 
5 all non-limiting examples of methods that can be combined with or used in conjunction with 
the nucleic acid molecules (e.g. DNAzymes) of the instant invention. Common 
chemotherapies that can be combined with nucleic acid molecules of the instant invention 
include various combinations of cytotoxic drugs to kill cancer cells. These drugs include but 
are not limited to paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, cyclophosphamide, 
10 doxorubin, fluorouracil carboplatin, edatrexate, gemcitabine, vinorelbine etc. Those skilled 
in the art will recognize that other drug compounds and therapies can be similarly be readily 
combined with the nucleic acid molecules of the instant invention (e.g. DNAzyme molecules) 
are hence within the scope of the instant invention. 

Diagnostic uses 

1 5 The nucleic acid molecules of this invention (e.g., enzymatic nucleic acid molecules) 

are used as diagnostic tools to examine genetic drift and mutations within diseased cells or to 
detect the presence of Ras RNA in a cell. The close relationship between enzymatic nucleic 
acid molecule activity and the structure of the target RNA allows the detection of mutations 
in any region of the molecule that alters the base-pairing and three-dimensional structure of 

20 the target RNA. Using multiple enzymatic nucleic acid molecules described in this invention, 
one maps nucleotide changes which are important to RNA structure and function in vitro, as 
well as in cells and tissues. Cleavage of target RNAs with enzymatic nucleic acid molecules 
are used to inhibit gene expression and define the role (essentially) of specified gene products 
in the progression of disease. In this manner, other genetic targets are defined as important 

25 mediators of the disease. These experiments lead to better treatment of the disease 
progression by affording the possibility of combinational therapies (e.g., multiple enzymatic 
nucleic acid molecules targeted to different genes, enzymatic nucleic acid molecules coupled 
with known small molecule inhibitors, or intermittent treatment with combinations of 
enzymatic nucleic acid molecules and/or other chemical or biological molecules). Other in 

30 vitro uses of enzymatic nucleic acid molecules of this invention are known in the art, and 
include detection of the presence of mRNAs associated with Ras-related condition. Such 
RNA is detected by determining the presence of a cleavage product after treatment with an 
enzymatic nucleic acid molecule using standard methodology. 
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ia a specific example, enzymatic nucleic acid molecules that cleave only wild-type or 
mutant forms of the target RNA are used for the assay. The first enzymatic nucleic acid 
molecule is used to identify wild-type RNA present in the sample and the second enzymatic 
nucleic acid molecule is used to identify mutant RNA in the sample. As reaction controls, 
5 synthetic substrates of both wild-type and mutant RNA are cleaved by both enzymatic nucleic 
acid molecules to demonstrate the relative enzymatic nucleic acid molecule efficiencies in the 
reactions and the absence of cleavage of the "non-targeted" RNA species. The cleavage 
products from the synthetic substrates also serve to generate size markers for the analysis of 
wild-type and mutant RNAs in the sample population. Thus each analysis requires two 

10 enzymatic nucleic acid molecules, two substrates and one unknown sample which is 
combined into six reactions. The presence of cleavage products is determined using an 
RNAse protection assay so that full-length and cleavage fragments of each RNA can be 
analyzed in one lane of a polyacrylamide gel. It is not absolutely required to quantify the 
results to gain insight into the expression of mutant RNAs and putative risk of the desired 

1 5 phenotypic changes in target cells. The expression of mRNA whose protein product is 
implicated in the development of the phenotype (i.e., Ras) is adequate to establish risk. If 
probes of comparable specific activity are used for both transcripts, then a qualitative 
comparison of RNA levels will be adequate and will decrease the cost of the initial diagnosis. 
Higher mutant form to wild-type ratios are correlated with higher risk whether RNA levels 

20 are compared qualitatively or quantitatively. The use of enzymatic nucleic acid molecules in 
diagnostic applications contemplated by the instant invention is described, for example, in 
George et al, US Patent Nos. 5,834,186 and 5,741,679, Shih et al, US Patent No. 5,589,332, 
Nathan et al, US Patent No 5,871,914, Nathan and Ellington, International PCT publication 
No. WO 00/24931, Breaker et al, International PCT Publication Nos. WO 00/26226 and 

25 98/27104, and Sullenger et al, International PCT publication No. WO 99/29842. 

Example 6: Identification of Potential Target Sites in Human HIV RNA 

The sequence of human HIV genes are screened for accessible sites using a computer- 
folding algorithm. Regions of the RNA that do not form secondary folding structures and 
contained potential enzymatic nucleic acid molecule and/or antisense binding/cleavage sites 
30 are identified. The sequences of these binding/cleavage sites are shown in Tables VI to XI. 

Example 6: Selection of Enzymatic Nucleic Acid Cleavage Sites in Human HIV RNA 

Enzymatic nucleic acid molecule target sites were chosen by analyzing sequences of 
Human HIV (Genbank accession No: NM_005228) and prioritizing the sites on the basis of 
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folding. Enzymatic nucleic acid molecules were designed that can bind each target and are 
individually analyzed by computer folding (Christoffersen et al, 1994 J. Mol Struc. 
Tlieochem, 311, 273; Jaeger et al, 1989, Proc. Natl Acad. Set USA, 86, 7706) to assess 
whether the enzymatic nucleic acid molecule sequences fold into the appropriate secondary 
5 structure. Those enzymatic nucleic acid molecules with unfavorable intramolecular 
interactions between the binding arms and the catalytic core were eliminated from 
consideration. As noted below, varying binding arm lengths can be chosen to optimize 
activity. Generally, at least 5 bases on each arm are able to bind to, or otherwise interact 
with, the target RNA. 

10 Example 8: Chemical Synthesis and Purification of Ribozvmes and Antisense for Efficient 
Cleavage and/or blocking of HIV Activity 

Enzymatic nucleic acid molecules and antisense constructs are designed to anneal to 
various sites in the RNA message. The binding arms of the enzymatic nucleic acid molecules 
are complementary to the target site sequences described above, while the antisense 

15 constructs are fully complementary to the target site sequences described above. The 
enzymatic nucleic acid molecules and antisense constructs were chemically synthesized. The 
method of synthesis used followed the procedure for normal RNA synthesis as described 
above and in Usman et al, (1987 J. Am. Chem. Soc, 109, 7845), Scaringe et al, (1990 
Nucleic Acids Res., 18, 5433) and Wincott et al, supra, and made use of common nucleic 

20 acid protecting and coupling groups, such as dimethoxytrityl at the 5-end, and 
phosphoramidites at the 3-end. The average stepwise coupling yields were typically >98%. 

Enzymatic nucleic acid molecules and antisense constructs are also synthesized from 
DNA templates using bacteriophage T7 RNA polymerase (Milligan and Uhlenbeck, 1989, 
Methods Enzymol. 180, 51). Enzymatic nucleic acid molecules and antisense constructs are 

25 purified by gel electrophoresis using general methods or are purified by high pressure liquid 
chromatography (HPLC; See Wincott et al, supra; the totality of which is hereby 
incorporated herein by reference) and are resuspended in water. The sequences of the 
chemically synthesized enzymatic nucleic acid molecules used in this study are shown below 
in Table XI. The sequences of the chemically synthesized antisense constructs used in this 

30 study are complementary sequences to the Substrate sequences shown below as in Tables VI 
to XI. 

Example 8: Enzymatic nucleic acid molecule Cleavage of HIV RNA Target in vitro 
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Enzymatic nucleic acid molecules targeted to the human HP/ RNA are designed and 
synthesized as described above. These enzymatic nucleic acid molecules are tested for 
cleavage activity in vitro, for example, using the following procedure. The target sequences 
and the nucleotide location within the HIV RNA are given in Tables VI to XI. 

5 Cleavage Reactions: Full-length or partially full-length, internally-labeled target RNA 

for enzymatic nucleic acid molecule cleavage assay is prepared by in vitro transcription in the 
presence of [a- 32 p] CTP, passed over a G 50 Sephadex column by spin chromatography and 
used as substrate RNA without further purification. Alternately, substrates are 5 -32p-end 
labeled using T4 polynucleotide kinase enzyme. Assays are performed by pre-warming a 2X 

1 0 concentration of purified enzymatic nucleic acid molecule in enzymatic nucleic acid molecule 
cleavage buffer (50 mM Tris-HCl, pH 7.5 at 37°C, 10 mM MgC^) and the cleavage reaction 
was initiated by adding the 2X enzymatic nucleic acid molecule mix to an equal volume of 
substrate RNA (maximum of 1-5 nM) that was also pre-warmed in cleavage buffer. As an 
initial screen, assays are carried out for 1 hour at 37°C using a final concentration of either 40 

15 nM or 1 mM enzymatic nucleic acid molecule, i.e., enzymatic nucleic acid molecule excess. 
The reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 
EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is heated 

o 

to 95 C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide gel. 
Substrate RNA and the specific RNA cleavage products generated by enzymatic nucleic acid 
20 molecule cleavage are visualized on an autoradiograph of the gel. The percentage of cleavage 
is determined by Phosphor Imager® quantitation of bands representing the intact substrate 
and the cleavage products. 

Indications 

Particular degenerative and disease states that can be associated with HIV expression 
25 modulation include but are not limited to acquired immunodeficiency disease (AIDS) and 
related diseases and conditions, including but not limited to Kaposi's sarcoma, lymphoma, 
cervical cancer, squamous cell carcinoma, cardiac myopathy, rheumatic diseases, and 
opportunistic infection, for example Pneumocystis carinii, Cytomegalovirus, Herpes simplex, 
Mycobacteria, Cryptococcus, Toxoplasma, Progressive multifocal leucoencepalopathy 
30 (Papovavirus), Mycobacteria, Aspergillus, Cryptococcus, Candida, Cryptosporidium, Isospora 
belli, Microsporidia and any other diseases or conditions that are related to or will respond to 
the levels of HIV in a cell or tissue, alone or in combination with other therapies 
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The present body of knowledge in HIV research indicates the need for methods to assay 
HIV activity and for compounds that can regulate HIV expression for research, diagnostic, 
and therapeutic use. 

The use of antiviral compounds, monoclonal antibodies, chemotherapy, radiation 
5 therapy, analgesics, and/or anti-inflammatory compounds, are all non-limiting examples of a 
methods that can be combined with or used in conjunction with the nucleic acid molecules 
(e.g. ribozymes and antisense molecules) of the instant invention. Examples of antiviral 
compounds that can be used in conjunction with the nucleic acid molecules of the invention 
include but are not limited to AZT (also known as zidovudine or ZDV), ddC (zalcitabine), ddl 

10 (dideoxyinosine), d4T (stavudine), and 3TC (lamivudine) Ribavirin, delvaridine (Rescriptor), 
nevirapine (Viramune), efravirenz (Sustiva), ritonavir (Norvir), saquinivir (Invirase), 
indinavir (Crixivan), amprenivir (Agenerase), nelfinavir (Viracept), and/or lopinavir 
(Kaletra). Common chemotherapies that can be combined with nucleic acid molecules of the 
instant invention include various combinations of cytotoxic drugs to kill cancer cells. These 

1 5 drugs include but are not limited to paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, gemcitabine, vinorelbine 
etc. Those skilled in the art will recognize that other drug compounds and therapies can be 
similarly be readily combined with the nucleic acid molecules of the instant invention (e.g. 
ribozymes and antisense molecules) are hence within the scope of the instant invention. 

10 Diagnostic uses 

The nucleic acid molecules of this invention {e.g., enzymatic nucleic acid molecules) 
are used as diagnostic tools to examine genetic drift and mutations within diseased cells or to 
detect the presence of HTV RNA in a cell. The close relationship between enzymatic nucleic 
acid molecule activity and the structure of the target RNA allows the detection of mutations 

25 in any region of the molecule which alters the base-pairing and three-dimensional structure of 
the target RNA. Using multiple enzymatic nucleic acid molecules described in this invention, 
one maps nucleotide changes which are important to RNA structure and function in vitro, as 
well as in cells and tissues. Cleavage of target RNAs with enzymatic nucleic acid molecules 
are used to inhibit gene expression and define the role (essentially) of specified gene products 

iO in the progression of disease. In this manner, other genetic targets are defined as important 
mediators of the disease. These experiments lead to better treatment of the disease 
progression by affording the possibility of combinational therapies (e.g., multiple enzymatic 
nucleic acid molecules targeted to different genes, enzymatic nucleic acid molecules coupled 
with known small molecule inhibitors, or intermittent treatment with combinations of 
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enzymatic nucleic acid molecules and/or other chemical or biological molecules). Other in 
vitro uses of enzymatic nucleic acid molecules of this invention are well known in the art, and 
include detection of the presence of mRNAs associated with HIV-related condition. Such 
RNA is detected by determining the presence of a cleavage product after treatment with an 
5 enzymatic nucleic acid molecule using standard methodology. 

In a specific example, enzymatic nucleic acid molecules which cleave only wild-type or 
mutant forms of the target RNA are used for the assay. The first enzymatic nucleic acid 
molecule is used to identify wild-type RNA present in the sample and the second enzymatic 
nucleic acid molecule is used to identify mutant RNA in the sample. As reaction controls, 

1 0 synthetic substrates of both wild-type and mutant RNA are cleaved by both enzymatic nucleic 
acid molecules to demonstrate the relative enzymatic nucleic acid molecule efficiencies in the 
reactions and the absence of cleavage of the "non-targeted" RNA species. The cleavage 
products from the synthetic substrates also serve to generate size markers for the analysis of 
wild-type and mutant RNAs in the sample population. Thus each analysis requires two 

15 enzymatic nucleic acid molecules, two substrates and one unknown sample which is 
combined into six reactions. The presence of cleavage products is determined using an 
RNAse protection assay so that full-length and cleavage fragments of each RNA can be 
analyzed in one lane of a polyacrylamide gel. It is not absolutely required to quantify the 
results to gain insight into the expression of mutant RNAs and putative risk of the desired 

20 phenotypic changes in target cells. The expression of mRNA whose protein product is 
implicated in the development of the phenotype (i.e., HIV) is adequate to establish risk. If 
probes of comparable specific activity are used for both transcripts, then a qualitative 
comparison of RNA levels will be adequate and will decrease the cost of the initial diagnosis. 
Higher mutant form to wild-type ratios are correlated with higher risk whether RNA levels 

25 are compared qualitatively or quantitatively. The use of enzymatic nucleic acid molecules in 
diagnostic applications contemplated by the instant invention is more fully described in 
George et aL, US Patent Nos. 5,834,186 and 5,741,679, Shih et aL, US Patent No. 5,589,332, 
Nathan et aL, US Patent No 5,871,914, Nathan and Ellington, International PCT publication 
No. WO 00/24931, Breaker et aL, International PCT Publication Nos. WO 00/26226 and 

30 98/27104, and Sullenger et aL, International PCT publication No. WO 99/29842. 

Example 10: Identification of Potential Target Sites in Human HER2 RNA 

The sequence of human HER2 genes were screened for accessible sites using a 
computer-folding algorithm. Regions of the RNA that do not form secondary folding 
structures and contained potential enzymatic nucleic acid molecule and/or antisense 
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binding/cleavage sites were identified. The sequences of these binding/cleavage sites are 
shown in Tables IV and V. 

Example 10: Selection of Enzymatic Nucleic Acid Cleavage Sites in Human HER2 RNA 

Enzymatic nucleic acid molecule target sites were chosen by analyzing sequences of 
5 Human HER2 (Genbank accession No: X03363) and prioritizing the sites on the basis of 
folding. Enzymatic nucleic acid molecules were designed that can bind each target and are 
individually analyzed by computer folding (Christoffersen et al, 1994 7. Mol Struc. 
Theochem, 311, 273; Jaeger et al, 1989, Proa Natl. Acad. Sci. USA, 86, 7706) to assess 
whether the enzymatic nucleic acid molecule sequences fold into the appropriate secondary 
10 structure. Those enzymatic nucleic acid molecules with unfavorable intramolecular 
interactions between the binding arms and the catalytic core were eliminated from 
consideration. As noted below, variable binding arm lengths are chosen to optimize activity. 
Generally, at least 5 bases on each arm are able to bind to, or otherwise interact with, the 
target RNA. 

15 Example 12: Chemical Synthesis and Purification of Ribozymes and Antisense for Efficient 
Cleavage and/or Blocking of HER2 Expression 

DNAzyme molecules are designed to anneal to various sites in the RNA message. The 
binding arms of the DNAzyme molecules are complementary to the target site sequences 
described above. The DNAzymes were chemically synthesized. The method of synthesis 

20 used followed the procedure for nucleic acid synthesis as described above and in Usman et 
al, (1987 J. Am. Chem. Soc, 109, 7845), Scaringe et al, (1990 Nucleic Acids Res., 18, 
5433) and Wincott et al., supra, and made use of common nucleic acid protecting and 
coupling groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. 
The average stepwise coupling yields were typically >98%. The sequences of the chemically 

25 synthesized DNAzyme molecules used in this study are shown below in Table V. 

Example 13: DNAzyme Cleavage of HER2 RNA Target in vitro 

DNAzymes targeted to the human HER2 RNA are designed and synthesized as 
described above. These enzymatic nucleic acid molecules can be tested for cleavage activity 
in vitro, for example, using the following procedure. The target sequences and the nucleotide 
30 location within the HER2 RNA are given in Tables IV and V. 
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Cleavage Reactions: 

Ribozymes and substrates were synthesized in 96-well format using 0.2]wmol scale. 
Substrates were 5'- 32 P labeled and gel purified using 7.5% polyacrylamide gels, and eluting 
into water. Assays were done by combining trace substrate with 500nM Ribozyme or greater, 
5 and initiated by adding final concentrations of 40mM Mg +2 , and 50mM Tris-Cl pH 8.0. For 
each ribozyme/substrate combination a control reaction was done to ensure cleavage was not 
the result of non-specific substrate degradation. A single three hour time point was taken and 
run on a 15% polyacrylamide gel to asses cleavage activity. Gels were dried and scanned 
using a Molecular Dynamics Phosphorimager and quantified using Molecular Dynamics 
10 ImageQuant software. Percent cleaved was determined by dividing values for cleaved 
substrate bands by full-length (uncleaved) values plus cleaved values and multiplying by 100 
(%cleaved-[C/(U+C)]*100). 

Example 14: DNAzvme Cleavage of HER2 RNA Target in vivo 
Cell Culture Review 

1 5 The greatest HER2 specific effects have been observed in cancer cell lines that express 

high levels of HER2 protein (as measured by ELIS A). Specifically, in one study that treated 
five human breast cancer cell lines with the HER2 antibody (anti-erbB2-sFv), the greatest 
inhibition of cell growth was seen in three cell lines (MDA-MB-361, SKBR-3 and BT-474) 
that express high levels of HER2 protein. No inhibition of cell growth was observed in two 

20 cell lines (MDA-MB-231 and MCF-7) that express low levels of HER2 protein (Wright, M., 
Grim, J., Deshane, J., Kim, M., Strong, T.V., Siegel, G.P., Curiel, D.T. (1997) An 
intracellular anti-erbB-2 single-chain antibody is specifically cytotoxic to human breast 
carcinoma cells overexpressing erbB-2. Gene Therapy 4: 317-322). Another group 
successfully used SKBR-3 cells to show HER2 antisense oligonucleotide-mediated inhibition 

25 of HER2 protein expression and HER2 RNA knockdown (Vaughn, J.P., Iglehart, J.D., 
Demirdji, S., Davis, P., Babiss, L.E., Caruthers, M.H., Marks, J.R. (1995) Antisense DNA 
downregulation of the ERBB2 oncogene measured by a flow cytometric assay. Proc Natl 
Acad Sci USA 92: 8338-8342). Other groups have also demonstrated a decrease in the levels 
of HER2 protein, HER2 mRNA and/or cell proliferation in cultured cells using anti-HER2 

30 DNAzymes or antisense molecules (Suzuki T., Curcio, L.D., Tsai, J. and Kashani-Sabet M. 
(1997) Anti-c-er&-B-2 Ribozyme for Breast Cancer. In Methods in Molecular Medicine, Vol. 
11, Therapeutic Applications of Ribozmes, Human Press, Inc., Totowa, NJ; Weichen, K., 
Zimmer, C. and Dietel, M. (1997) Selection of a high activity c-erbB-2 ribozyme using a 



WO 02/097114 



PCT/US02/16840 



73 

fusion gene of c-erbB-2 and the enhanced green fluorescent protein. Cancer Gene Therapy 5: 
45-51; Czubayko, F„ Downing, S.G., Hsieh, S.S., Goldstein, DJ., Lu P.Y., Trapnell, B.C. 
and Wellstein, A. (1997) Adenovirus-mediated transduction of ribozymes abrogates HER- 
2/neu and pleiotrophin expression and inhibits tumor cell proliferation. Gene Ther. 4: 943- 
5 949; Colomer, R., Lupu, R., Bacus, S.S. and Gelmann, E.P. (1994) erbB-2 antisense 
oligonucloetides inhibit the proliferation of breast carcinoma cells with erbB-2 oncogene 
amplification. British J. Cancer 70: 819-825; Betram et aL, 1994). Because cell lines that 
express higher levels of HER2 have been more sensitive to anti-HER2 agents, we prefer using 
several medium to high expressing cell lines, including SKBR-3 and T47D, for DNAzyme 
1 0 screens in cell culture. 

A variety of endpoints have been used in cell culture models to look at HER2-mediated 
effects after treatment with anti-HER2 agents. Phenotypic endpoints include inhibition of cell 
proliferation, apoptosis assays and reduction of HER2 protein expression. Because 
overexpression of HER2 is directly associated with increased proliferation of breast and 

1 5 ovarian tumor cells, a proliferation endpoint for cell culture assays will preferably be used as 
the primary screen. There are several methods by which this endpoint can be measured. 
Following treatment of cells with DNAzymes, cells are allowed to grow (typically 5 days) 
after which either the cell viability, the incorporation of [ 3 H] thymidine into cellular DNA 
and/or the cell density can be measured. The assay of cell density is very straightforward and 

20 can be done in a 96-well format using commercially available fluorescent nucleic acid stains 
(such as Syto® 13 or CyQuant®). The assay using CyQuant® is described herein and is 
currently being employed to screen -100 DNAzymes targeting HER2 (details below). 

As a secondary, confirmatory endpoint a DNAzyme-mediated decrease in the level of 
HER2 protein expression can be evaluated using a HER2-specific ELIS A. 

25 Validation of Cell Lines and DNAzyme Treatment Conditions 

Two human breast cancer cell lines (T47D and SKBR-3) that are known to express 
medium to high levels of HER2 protein, respectively, are considered for DNAzyme 
screening. In order to validate these cell lines for HER2-mediated sensitivity, both cell lines 
are treated with the HER2 specific antibody, Herceptin® (Genentech) and its effect on cell 
30 proliferation is determined. Herceptin® is added to cells at concentrations ranging from 0-8 
yiM in medium containing either no serum (OptiMem), 0.1% or 0.5% FBS and efficacy is 
determined via cell proliferation. Maximal inhibition of proliferation (-50%) in both cell 
lines is typically observed after addition of Herceptin® at 0.5 nM in medium containing 0.1% 
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or no FBS. The fact that both cell lines are sensitive to an anti-HER2 agent (Herceptin®) 
supports their use in experiments testing anti-HER2 DNAzymes. 

Prior to DNAzyme screening, the choice of the optimal lipid(s) and conditions for 
DNAzyme delivery is determined empirically for each cell line. Applicant has established a 
5 panel of cationic lipids (lipids as described in PCT application WO99/05094) that can be used 
to deliver DNAzymes to cultured cells and are very useful for cell proliferation assays that are 
typically 3-5 days in length. (Additional description of useful lipids is provided above, and 
those skilled in the art are also familiar with a variety of lipids that can be used for delivery of 
oligonucleotide to cells in culture.) Initially, this panel of lipid delivery vehicles is screened 
10 in SKBR-3 and T47D cells using previously established control oligonucleotides. Specific 
lipids and conditions for optimal delivery are selected for each cell line based on these 
screens. These conditions are used to deliver HER2 specific DNAzymes to cells for primary 
(inhibition of cell proliferation) and secondary (decrease in HER2 protein) efficacy endpoints. 

Primary Screen: Inhibition of Cell Proliferation 

15 DNAzyme screens are performed using an automated, high throughput 96-well cell 

proliferation assay. Cell proliferation is measured over a 5-day treatment period using the 
nucleic acid stain CyQuant® for determining cell density. The growth of cells treated with 
DNAzyme/lipid complexes is compared to both untreated cells and to cells treated with 
Scrambled-arm Attenuated core Controls. SACs can no longer bind to the target site due to 

20 the scrambled arm sequence and have nucleotide changes in the core that greatly diminish 
DNAzyme cleavage. These SACs are used to determine non-specific inhibition of cell 
growth caused by DNAzyme chemistry {i.e. multiple 2' O-Me modified nucleotides and a 3* 
inverted abasic). Lead DNAzymes are chosen from the primary screen based on their ability 
to inhibit cell proliferation in a specific manner. Dose response assays are carried out on 

25 these leads and a subset was advanced into a secondary screen using the level of HER2 
protein as an endpoint. 

Secondary Screen: Decrease in HER2 Protein and/or RNA 

A secondary screen that measures the effect of anti-HER2 DNAzymes on HER2 protein 
and/or RNA levels is used to affirm preliminary findings. A robust HER2 ELISA for both 
30 T47D and SKBR-3 cells has been established and is available for use as an additional 
endpoint. In addition, a real time RT-PCR assay (TaqMan assay) has been developed to 
assess HER2 RNA reduction compared to an actin RNA control. Dose response activity of 
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nucleic acid molecules of the instant invention can be used to assess both HER2 protein and 
RNA reduction endpoints. 

DNAzyme Mechanism Assays 

A TaqMan® assay for measuring the DNAzyme-mediated decrease in HER2 RNA has 
5 also been established. This assay is based on PCR technology and can measure in real time 
the production of HER2 iriRNA relative to a standard cellular mRNA such as GAPDH. This 
RNA assay is used to establish proof that lead DNAzymes are working through an RNA 
cleavage mechanism and result in a decrease in the level of HER2 mRNA, thus leading to a 
decrease in cell surface HER2 protein receptors and a subsequent decrease in tumor cell 
1 0 proliferation. 

Animal Models 

Evaluating the efficacy of anti-HER2 agents in animal models is an important 
prerequisite to human clinical trials. As in cell culture models, the most HER2 sensitive 
mouse tumor xenografts are those derived from human breast carcinoma cells that express 

1 5 high levels of HER2 protein. In a recent study, nude mice bearing BT-474 xenografts were 
sensitive to the anti-HER2 humanized monoclonal antibody Herceptin®, resulting in an 80% 
inhibition of tumor growth at a 1 mg kg dose (ip, 2 X week for 4-5 weeks). Tumor 
eradication was observed in 3 of 8 mice treated in this manner (Baselga, J., Norton, L. 
Albanell, J., Kim, Y.M. and Mendelsohn, J. (1998) Recombinant humanized anti-HER2 

20 antibody (Herceptin) enhances the antitumor activity of paclitaxel and doxorubicin against 
HER2/neu overexpressing human breast cancer xenografts. Cancer Res. 15: 2825-2831). 
This same study compared the efficacy of Herceptin® alone or in combination with the 
commonly used chemotherapeutics, paclitaxel or doxorubicin. Although, all three anti-HER2 
agents caused modest inhibition of tumor growth, the greatest antitumor activity was 

25 produced by the combination of Herceptin® and paclitaxel (93% inhibition of tumor growth 
vs 35% with paclitaxel alone). The above studies provide proof that inhibition of HER2 
expression by anti-HER2 agents causes inhibition of tumor growth in animals. Lead anti- 
HER2 DNAzymes chosen from in vitro assays are further tested in mouse xenograft models. 
DNAzymes are first tested alone and then in combination with standard chemotherapies. 

30 Animal Model Development 

Three human breast tumor cell lines (T47D, SKBR-3 and BT-474) were characterized 
to establish their growth curves in mice. These three cell lines have been implanted into the 
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mammary papillae of both nude and SCID mice and primary tumor volumes are measured 3 
times per week. Growth characteristics of these tumor lines using a Matrigel implantation 
format can also be established. The use of two other breast cell lines that have been 
engineered to express high levels of HER2 can also be used in the described studies. The 
5 tumor cell line(s) and implantation method that supports the most consistent and reliable 
tumor growth is used in animal studies testing the lead HER2 DNAzyme(s). DNAzymes are 
administered by daily subcutaneous injection or by continuous subcutaneous infusion from 
Alzet mini osmotic pumps beginning 3 days after tumor implantation and continuing for the 
duration of the study. Group sizes of at least 10 animals are employed. Efficacy is 
1 0 determined by statistical comparison of tumor volume of DNAzyme-treated animals to a 
control group of animals treated with saline alone. Because the growth of these tumors is 
generally slow (45-60 days), an initial endpoint is the time in days it takes to establish an 
easily measurable primary tumor (i.e. 50-100 mm 3 ) in the presence or absence of DNAzyme 
treatment. 

1 5 Clinical Summary 

Ovewiew 

Breast cancer is a common cancer in women and also occurs in men to a lesser degree. 
The incidence of breast cancer in the United States is -1 80,000 cases per year and -46,000 
die each year of the disease. In addition, 21,000 new cases of ovarian cancer per year lead to 

20 -13,000 deaths (data from Hung, M.-C, Matin, A., Zhang, Y., Xing, X., Sorgi, R, Huang, L. 
and Yu, D. (1995) HER-2/neu-targeting gene therapy - a review. Gene 159: 65-71 and the 
Surveillance, Epidemiology and End Results Program, NCI Surveillance, Epidemiology and 
End Results Program (SEER) Cancer Statistics Review: 
http://www.seer.ims.nci.nih.gov/Publications/CSR1973_1996/). Ovarian cancer is a potential 

25 secondary indication for anti-HER2 DNAzyme therapy. 

A full review of breast cancer is given in the NCI PDQ for Breast Cancer (NCI 
PDQ/Treatment/Health Professionals/Breast Cancer: 

http.7/cancemet.nci.nih.gov/clinpdq/soa/Breast_cancer_Physician.html; NCI 
PDQ/Treatment/Patients/Breast Cancer: 
30 http://cancernet.nci.nih.gov/clinpdq/pif/Breast cancer Patient.html) . A brief overview is 
given here. Breast cancer is evaluated or "staged" on the basis of tumor size, and whether it 
has spread to lymph nodes and/or other parts of the body. In Stage I breast cancer, the cancer 
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is no larger than 2 centimeters and has not spread outside of the breast. In Stage II, the 
patient's tumor is 2-5 centimeters but cancer may have spread to the axillary lymph nodes. 
By Stage HI, metastasis to the lymph nodes is typical, and tumors are > 5 centimeters. 
Additional tissue involvement (skin, chest wall, ribs, muscles etc.) may also be noted. Once 
5 cancer has spread to additional organs of the body, it is classed as Stage IV." 

Almost all breast cancers (>90%) are detected at Stage I or II, but 31% of these are 
already lymph node positive. The 5-year survival rate for node negative patients (with 
standard surgery/radiation/chemotherapy /hormone regimens) is 97%; however, involvement 
of the lymph nodes reduces the 5-year survival to only 77%. Involvement of other organs 
10 (>Stage EI) drastically reduces the overall survival, to 22% at 5 years. Thus, chance of 
recovery from breast cancer is highly dependent on early detection. Because up to 10% of 
breast cancers are hereditary, those with a family history are considered to be at high risk for 
breast cancer and should be monitored very closely. 

Therapy 

1 5 Breast cancer is highly treatable and often curable when detected in the early stages. 

(For a complete review of breast cancer treatments, see the NCI PDQ for Breast Cancer.) 
Common therapies include surgery, radiation therapy, chemotherapy and hormonal therapy. 
Depending upon many factors, including the tumor size, lymph node involvement and 
location of the lesion, surgical removal varies from lumpectomy (removal of the tumor and 

20 some surrounding tissue) to mastectomy (removal of the breast, lymph nodes and some or all 
of the underlying chest muscle). Even with successful surgical resection, as many as 21% of 
the patients may ultimately relapse (10-20 years). Thus, once local disease is controlled by 
surgery, adjuvant radiation treatments, chemotherapies and/or hormonal therapies are 
typically used to reduce the rate of recurrence and improve survival. The therapy regimen 

15 employed depends not only on the stage of the cancer at its time of removal, but other 
variables such the type of cancer (ductal or lobular), whether lymph nodes were involved and 
removed, age and general health of the patient and if other organs are involved. 

Common chemotherapies include various combinations of cytotoxic drugs to kill the 
cancer cells. These drugs include paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
30 cyclophosphamide, doxorubin, fluorouracil etc. Significant toxicities are associated with 
these cytotoxic therapies. Well-characterized toxicities include nausea and vomiting, 



WO 02/097114 



PCT/US02/16840 



78 

myelosuppression, alopecia and mucosity. Serious cardiac problems are also associated with 
certain of the combinations, e.g. doxorubin and paclitaxel, but are less common. 

Testing for estrogen and progesterone receptors helps to determine whether certain anti- 
hormone therapies migjbt be helpful in inhibiting tumor growth. If either or both receptors are 
5 present, therapies to interfere with the action of the hormone ligands, can be given in 
combination with chemotherapy and are generally continued for several years. These 
adjuvant therapies are called SERMs, selective estrogen receptor modulators, and they can 
give beneficial estrogen-like effects on bone and lipid metabolism while antagonizing 
estrogen in reproductive tissues. Tamoxifen is one such compound. The primary toxic effect 

1 0 associated with the use of tamoxifen is a 2 to 7-fold increase in the rate of endometrial cancer. 
Blood clots in the legs and lung and the possibility of stroke are additional side effects. 
However, tamoxifen has been determined to reduce breast cancer incidence by 49% in high- 
risk patients and an extensive, somewhat controversial, clinical study is underway to expand 
the prophylactic use of tamoxifen. Another SERM, raloxifene, was also shown to reduce the 

15 incidence of breast cancer in a large clinical trial where it was being used to treat 
osteoporosis. In additional studies, removal of the ovaries and/or drugs to keep the ovaries 
from working are being tested. 

Bone marrow transplantation is being studied in clinical trials for breast cancers that 
have become resistant to traditional chemotherapies or where >3 lymph nodes are involved. 
20 Marrow is removed from the patient prior to high-dose chemotherapy to protect it from being 
destroyed, and then replaced after the chemotherapy. Another type of "transplant" involves 
the exogenous treatment of peripheral blood stem cells with drugs to kill cancer cells prior to 
replacing the treated cells in the bloodstream. 

One biological treatment, a humanized monoclonal anti-HER2 antibody, Herceptin® 
25 (Genentech) has been approved by the FDA as an additional treatment for HER2 positive 
tumors. Herceptin® binds with high affinity to the extracellular domain of HER2 and thus 
blocks its signaling action. Herceptin® can be used alone or in combination with 
chemotherapeutics (i.e. paclitaxel, docetaxel, cisplatin, etc.) (Pegram, M.D., Lipton, A., 
Hayes, D.R, Weber, B.L., Baselga, J.M., Tripathy, D., Baly, D., Baughman, S.A., Twaddell, 
30 T., Glaspy, J.A. and Slamon, D.J. (1998) Phase II study of receptor-enhanced 
chemosensitivity using recombinant humanized anti-pl85HER2/neu monoclonal antibody 
plus cisplatin in patients with HER2/neu-overexpressing metastatic breast cancer refractory to 
chemotherapy treatment. J. Clin. Oncol. 16: 2659-2671). In Phase in studies, Herceptin® 
significantly improved the response rate to chemotherapy as well as improving the time to 
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progression (Ross, J.S. and Fletcher, J. A. (1998) The HER-2/neu oncogene in breast cancer: 
Prognostic factor, predictive factor and target for therapy. Oncologist 3: 1998). The most 
common side effects attributed to Herceptin® are fever and chills, pain, asthenia, nausea, 
vomiting, increased cough, diarrhea, headache, dyspnea, infection, rhinitis, and insomnia. 
5 Herceptin® in combination with chemotherapy (paclitaxel) can lead to cardiotoxicity 
(Sparano, J.A. (1999) Doxorubicin/taxane combinations: Cardiac toxicity and 
pharmacokinetics. Semin. Oncol. 26: 14-19), leukopenia, anemia, diarrhea, abdominal pain 
and infection. 

HER2 Protein Levels for Patient Screening and as a Potential Endpoint 

10 Because elevated HER2 levels can be detected in at least 30% of breast cancers, breast 

cancer patients can be pre-screened for elevated HER2 prior to admission to initial clinical 
trials testing an anti-HER2 DNAzyme. Initial HER2 levels can be determined (by ELISA) 
from tumor biopsies or resected tumor samples. 

During clinical trials, it may be possible to monitor circulating HER2 protein by ELISA 
15 (Ross and Fletcher, 1998). Evaluation of serial blood/serum samples over the course of the 
anti-HER2 DNAzyme treatment period could be useful in determining early indications of 
efficacy. In fact, the clinical course of Stage IV breast cancer was correlated with shed HER2 
protein fragment following a dose-intensified paclitaxel monotherapy. In all responders, the 
HER2 serum level decreased below the detection limit (Luftner, D., Schnabel. S. and 
20 Possinger, K. (1999) c-erbB-2 in serum of patients receiving fractionated paclitaxel 
chemotherapy. Int. J. Biol Markers 14: 55-59). 

Two cancer-associated antigens, CA27.29 and CA15.3, can also be measured in the 
serum. Both of these glycoproteins have been used as diagnostic markers for breast cancer. 
CA27.29 levels are higher than CA15.3 in breast cancer patients; the reverse is true in healthy 

25 individuals. Of these two markers, CA27.29 was found to better discriminate primary cancer 
from healthy subjects. In addition, a statistically significant and direct relationship was 
shown between CA27.29 and large vs small tumors and node postive vs node negative disease 
(Gion, M., Mione, R., Leon, A.E. and Dittadi, R. (1999) Comparison of the diagnostic 
accuracy of CA27.29 and CA15.3 in primary breast cancer. Clin. Chem. 45: 630-637). 

30 Moreover, both cancer antigens were found to be suitable for the detection of possible 
metastases during follow-up (Rodriguez de Paterna, L., Arnaiz, F., Estenoz, J. Ortuno, B. and 
Lanzos E. (1999) Study of serum tumor markers CEA, CA15.3, CA27.29 as diagnostic 
parameters in patients with breast carcinoma. Int. J. Biol. Markers 10: 24-29). Thus, 
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blocking breast tumor growth may be reflected in lower CA27.29 and/or CA15.3 levels 
compared to a control group. FDA submissions for the use of CA27.29 and CA15.3 for 
monitoring metastatic breast cancer patients have been filed (reviewed in Beveridge, R.A. 
(1999) Review of clinical studies of CA27.29 in breast cancer management. Int. J. Biol 
5 Markers 14: 36-39). Fully automated methods for measurement of either of these markers 
are commercially available. 

Indications 

Particular degenerative and disease states that can be associated with HER2 expression 
modulation include but are not limited to cancer, for example breast cancer and ovarian 
1 0 cancer and/or any other diseases or conditions that are related to or will respond to the levels 
of HER2 in a cell or tissue, alone or in combination with other therapies 

The present body of knowledge in HER2 research indicates the need for methods to 
assay HER2 activity and for compounds that can regulate HER2 expression for research, 
diagnostic, and therapeutic use. 

1 5 The use of monoclonal antibodies, chemotherapy, radiation therapy, and analgesics, are 

all non-limiting examples of methods that can be combined with or used in conjunction with 
the nucleic acid molecules {e.g. DNAzymes) of the instant invention. Common 
chemotherapies that can be combined with nucleic acid molecules of the instant invention 
include various combinations of cytotoxic drugs to kill cancer cells. These drugs include but 

10 are not limited to paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, cyclophosphamide, 
doxorubin, fluorouracil carboplatin, edatrexate, gemcitabine, vinorelbine etc. Those skilled 
in the art will recognize that other drug compounds and therapies can be similarly be readily 
combined with the nucleic acid molecules of the instant invention (e.g. DNAzyme molecules) 
are hence within the scope of the instant invention. 

-5 Diagnostic uses 

The nucleic acid molecules of this invention (e.g., enzymatic nucleic acid molecules) 
can be used as diagnostic tools to examine genetic drift and mutations within diseased cells or 
to detect the presence of HER2 RNA in a cell. The close relationship between enzymatic 
nucleic acid molecule activity and the structure of the target RNA allows the detection of 
30 mutations in any region of the molecule that alters the base-pairing and three-dimensional 
structure of the target RNA. By using multiple enzymatic nucleic acid molecules described in 
this invention, one can map nucleotide changes which are important to RNA structure and 
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function in vitro, as well as in cells and tissues. Cleavage of target RNAs with enzymatic 
nucleic acid molecules can be used to inhibit gene expression and define the role (essentially) 
of specified gene products in the progression of disease. In this manner, other genetic targets 
can be defined as important mediators of the disease. These experiments can lead to better 
5 treatment of the disease progression by affording the possibility of combinational therapies 
{e.g., multiple enzymatic nucleic acid molecules targeted to different genes, enzymatic 
nucleic acid molecules coupled with known small molecule inhibitors, or intermittent 
treatment with combinations of enzymatic nucleic acid molecules and/or other chemical or 
biological molecules). Other in vitro uses of enzymatic nucleic acid molecules of this 
10 invention are well known in the art, and include detection of the presence of mRNAs 
associated with HER2-related condition. Such RNA is detected by determining the presence 
of a cleavage product after treatment with an enzymatic nucleic acid molecule using standard 
methodology. 

In a specific example, enzymatic nucleic acid molecules that cleave only wild-type or 

1 5 mutant forms of the target RNA are used for the assay. The first enzymatic nucleic acid 
molecule is used to identify wild-type RNA present in the sample and the second enzymatic 
nucleic acid molecule is used to identify mutant RNA in the sample. As reaction controls, 
synthetic substrates of both wild-type and mutant RNA are cleaved by both enzymatic nucleic 
acid molecules to demonstrate the relative enzymatic nucleic acid molecule efficiencies in the 

20 reactions and the absence of cleavage of the "non-targeted" RNA species. The cleavage 
products from the synthetic substrates also serve to generate size markers for the analysis of 
wild-type and mutant RNAs in the sample population. Thus each analysis requires two 
enzymatic nucleic acid molecules, two substrates and one unknown sample which is 
combined into six reactions. The presence of cleavage products is determined using an 

25 RNAse protection assay so that full-length and cleavage fragments of each RNA can be 
analyzed in one lane of a polyacrylamide gel. It is not absolutely required to quantify the 
results to gain insight into the expression of mutant RNAs and putative risk of the desired 
phenotypic changes in target cells. The expression of mRNA whose protein product is 
implicated in the development of the phenotype (i.e., HER2) is adequate to establish risk. If 

JO probes of comparable specific activity are used for both transcripts, then a qualitative 
comparison of RNA levels will be adequate and will decrease the cost of the initial diagnosis. 
Higher mutant form to wild-type ratios are correlated with higher risk whether RNA levels 
are compared qualitatively or quantitatively. The use of enzymatic nucleic acid molecules in 
diagnostic applications contemplated by the instant invention is more fully described in 

55 George etal, US Patent Nos. 5,834,186 and 5,741,679, Shih etal, US Patent No. 5,589,332, 
Nathan et al, US Patent No 5,871,914, Nathan and Ellington, International PCT publication 
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No. WO 00/24931, Breaker et al, International PCT Publication Nos. WO 00/26226 and 
98/27104, and Sullenger et al, International PCT publication No. WO 99/29842. 



Additional Uses 

5 Potential uses of sequence-specific enzymatic nucleic acid molecules of the instant 

invention can have many of the same applications for the study of RNA that DNA restriction 
endonucleases have for the study of DNA (Nathans et al, 1975 Ann. Rev. Biochem. 44:273). 
For example, the pattern of restriction fragments can be used to establish sequence 
relationships between two related RNAs, and large RNAs can be specifically cleaved to 
1 0 fragments of a size more useful for study. The ability to engineer sequence specificity of the 
enzymatic nucleic acid molecule is ideal for cleavage of RNAs of unknown sequence. 
Applicant has described the use of nucleic acid molecules to modulate gene expression of 
target genes in bacterial, microbial, fungal, viral, and eukaryotic systems including plant or 
mammalian cells. 

1 5 All patents and publications mentioned in the specification are indicative of the levels of 

skill of those skilled in the art to which the invention pertains. All references cited in this 
disclosure are incorporated by reference to the same extent as if each reference had been 
incorporated by reference in its entirety individually. 

One skilled in the art would readily appreciate that the present invention is well adapted 
20 to carry out the objects and obtain the ends and advantages mentioned, as well as those 
inherent therein. The methods and compositions described herein as presently representative 
of preferred embodiments are exemplary and are not intended as limitations on the scope of 
the invention. Changes therein and other uses will occur to those skilled in the art, which are 
encompassed within the spirit of the invention, are defined by the scope of the claims. 

25 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications can be made to the invention disclosed herein without departing from the scope 
and spirit of the invention. Thus, such additional embodiments are within the scope of the 
present invention and the following claims. 

The invention illustratively described herein suitably can be practiced in the absence of 
30 any element or elements, limitation or limitations which is not specifically disclosed herein. 
Thus, for example, in each instance herein any of the terms "comprising", "consisting 
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essentially of" and "consisting of can be replaced with either of the other two terms. The 
terms and expressions that have been employed are used as terms of description and not of 
limitation, and there is no intention that in the use of such terms and expressions of excluding 
any equivalents of the features shown and described or portions thereof, but it is recognized 
5 that various modifications are possible within the scope of the invention claimed. Thus, it 
should be understood that although the present invention has been specifically disclosed by 
preferred embodiments, optional features, modification and variation of the concepts herein 
disclosed can be resorted to by those skilled in the art, and that such modifications and 
variations are considered to be within the scope of this invention as defined by the description 
1 0 and the appended claims. 

In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will recognize that 
the invention is also thereby described in terms of any individual member or subgroup of 
members of the Markush group or other group. 

1 5 Other embodiments are within the claims that follow. 
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Table I: 



Reagent 


Equivalents 


Amount 


Wait Time* DNA 


Wait Time* 2*-0-methyl 


Wait Time*RNA 














Phosphoramidites 


6.5 


163 uL 


45 sec 


2.5 min 


7.5 min 


S-Ethyl Tetrazole 


23.8 


238 pL 


45 sec 


2.5 min 


7.5 min 


Acetic Anhydride 


100 


233 pL 


5 sec 


5 sec 


5 sec 


N-Methyl 
Imidazole 


186 


233 pL 


5 sec 


5 sec 


5 sec 


TCA 


176 


2.3 mL 


21 sec 


21 sec 


21 sec 


iodine 


11.2 


1.7 mL 


45 sec 


45 sec 


45 sec 


Beaucage 


12.9 


645 pL 


100 sec 


300 sec 


300 sec 


Acetonitrile 


NA 


6.67 mL 


NA 


NA 


NA 



B. 0.2 ujnoj Synthesis Cycie ABI 394 Instrument 



Reagent 


Equivalents 


Amount 


Wait Time* DNA 


Wait Time* 2'-0-methyl 


Wait Time*RNA 














Phosphoramidites 


15 


31 pL 


45 sec 


233 sec 


465 sec 


S-Ethyl Tetrazole 


38.7 


31 pL 


45 sec 


233 min 


465 sec 


Acetic Anhydride 


655 


124 pL 


5 sec 


5 sec 


5 sec 


W-Methyl 
Imidazole 


1245 


124 pL 


5 sec 


5 sec 


5 sec 


TCA 


700 


732 pL 


10 sec 


10 sec 


10 sec 


Iodine 


20.6 


244 pL 


15 sec 


15 sec 


15 sec 


Beaucage 


7.7 


232 pL 


100 sec 


300 sec 


300 sec 


Acetonitrile 


NA 


2.64 mL 


NA 


NA 


NA 



C. 0.2 |jmol Synthesis Cycle 96 well Instrument 



Reagent 


Equivalents: DNA/ 
2'-0-methyl/Ribo 


Amount: DNA/2'-0- 
methyl/Ribo 


Wait Time* DNA 


Wait Time* 2'-0- 
methyl 


Wait Time* 
Ribo 














Phosphoramidites 


22733/66 


40/60/120 pL 


60 sec 


180 sec 


360sec 


S-Ethyl Tetrazole 


70/105/210 


40/60/120 pL 


60 sec 


180 min 


360 sec 


Acetic Anhydride 


265/265/265 


50/50/50 pL 


10 sec 


10 sec 


10 sec 


W-Methyl 
Imidazole 


502/502/502 


50/50/50 pL 


10 sec 


10 sec 


10 sec 


TCA 


238/475/475 


250/500/500 pL 


15 sec 


15 sec 


15 sec 


Iodine 


6.8/6.8/6.8 


80/80/80 pL 


30 sec 


30 sec 


30 sec 


Beaucage 


34/51/51 


80/120/120 


100 sec 


200 sec 


200 sec 


Acetonitrile 


NA 


1150/1150/1150 pL 


NA 


NA 


NA 



Wait time does not include contact time during delivery. 
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Table II: Human K-Ras DNAzyme and Substrate Sequence 



Pos 


Substrate 


Seq 
ID 


DNAzyme 


Seq 
ID 


10 


CCUAGGCG G CGGCCGCG 


1 


CGCGGCCG GGCTAGCTACAACGA CGCCTAGG 


2329 


13 


AGGCGGCG G CCGCGGCG 


2 


CGCCGCGG GGCTAGCTACAACGA CGCCGCCT 


2330 


16 


CGGCGGCC G CGGCGGCG 


3 


CGCCGCCG GGCTAGCTACAACGA GGCCGCCG 


2331 


19 


CGGCCGCG G CGGCGGAG 


4 


CTCCGCCG GGCTAGCTACAACGA CGCGGCCG 


2332 


22 


CCGCGGCG G CGGAGGCA 


5 


TGCCTCCG GGCTAGCTACAACGA CGCCGCGG 


2333 


28 


CGGCGGAG G CAGCAGCG 


6 


CGCTGCTG GGCTAGCTACAACGA CTCCGCCG 


2334 


31 


CGGAGGCA G CAGCGGCG 


7 


CGCCGCTG GGCTAGCTACAACGA TGCCTCCG 


2335 


34 


AGGCAGCA G CGGCGGCG 


8 


CGCCGCCG GGCTAGCTACAACGA TGCTGCCT 


2336 


37 


CAGCAGCG G CGGCGGCA 


9 


TGCCGCCG GGCTAGCTACAACGA CGCTGCTG 


2337 


40 


CAGCGGCG G CGGCAGUG 


10 


CACTGCCG GGCTAGCTACAACGA CGCCGCTG 


2338 


43 


CGGCGGCG G CAGUGGCG 


11 


CGCCACTG GGCTAGCTACAACGA CGCCGCCG 


2339 


46 


CGGCGGCA G UGGCGGCG 


12 


CGCCGCCA GGCTAGCTACAACGA TGCCGCCG 


2340 


49 


CGGCAGUG G CGGCGGCG 


13 


CGCCGCCG GGCTAGCTACAACGA CACTGCCG 


2341 


52 


CAGUGGCG G CGGCGAAG 


14 


CTTCGCCG GGCTAGCTACAACGA CGCCACTG 


2342 


55 


UGGCGGCG G CGAAGGUG 


15 


CACCTTCG GGCTAGCTACAACGA CGCCGCCA 


2343 


61 


CGGCGAAG G UGGCGGCG 


16 


CGCCGCCA GGCTAGCTACAACGA CTTCGCCG 


2344 


64 


CGAAGGUG G CGGCGGCU 


17 


AGCCGCCG GGCTAGCTACAACGA CACCTTCG 


2345 


67 


AGGUGGCG G CGGCUCGG 


18 


CCGAGCCG GGCTAGCTACAACGA CGCCACCT 


2346 


70 


UGGCGGCG G CUCGGCCA 


19 


TGGCCGAG GGCTAGCTACAACGA CGCCGCCA 


2347 


75 


GCGGCUCG G CCAGUACU 


20 


AGTACTGG GGCTAGCTACAACGA CGAGCCGC 


2348 


79 


CUCGGCCA G UACUCCCG 


21 


CGGGAGTA GGCTAGCTACAACGA TGGCCGAG 


2349 


81 


CGGCCAGU A CUCCCGGC 


22 


GCCGGGAG GGCTAGCTACAACGA ACTGGCCG 


2350 


88 


UACUCCCG G CCCCCGCC 


23 


GGCGGGGG GGCTAGCTACAACGA CGGGAGTA 


2351 


94 


CGGCCCCC G CCAUUUCG 


24 


CGAAATGG GGCTAGCTACAACGA GGGGGCCG 


2352 


97 


CCCCCGCC A UUUCGGAC 


25 


GTCCGAAA GGCTAGCTACAACGA GGCGGGGG 


2353 


104 


CAUUUCGG A CUGGGAGC 


26 


GCTCCCAG GGCTAGCTACAACGA CCGAAATG 


2354 


111 


GACUGGGA G CGAGCGCG 


27 


CGCGCTCG GGCTAGCTACAACGA TCCCAGTC 


2355 


115 


GGGAGCGA G CGCGGCGC 


28 


GCGCCGCG GGCTAGCTACAACGA TCGCTCCC 


2356 


117 


GAGCGAGC G CGGCGCAG 


29 


CTGCGCCG GGCTAGCTACAACGA GCTCGCTC 


2357 


120 


CGAGCGCG G CGCAGGCA 


30 


TGCCTGCG GGCTAGCTACAACGA CGCGCTCG 


2358 


122 


AGCGCGGC G CAGGCACU 


31 


AGTGCCTG GGCTAGCTACAACGA GCCGCGCT 


2359 


126 


CGGCGCAG G CACUGAAG 


32 


CTTCAGTG GGCTAGCTACAACGA CTGCGCCG 


2360 


128 


GCGCAGGC A CUGAAGGC 


33 


GCCTTCAG GGCTAGCTACAACGA GCCTGCGC 


2361 


135 


CACUGAAG G CGGCGGCG 


34 


CGCCGCCG GGCTAGCTACAACGA CTTCAGTG 


2362 


138 


UGAAGGCG G CGGCGGGG 


35 






141 


AGGCGGCG G CGGGGCCA 


36 


TGGCCCCG GGCTAGCTACAACGA CGCCGCCT 


2364 


146 


GCGGCGGG G CCAGAGGC 


37 


GCCTCTGG GGCTAGCTACAACGA CCCGCCGC 


2365 


153 


GGCCAGAG G CUCAGCGG 


38 


CCGCTGAG GGCTAGCTACAACGA CTCTGGCC 


2366 


158 


GAGGCUCA G CGGCUCCC 


39 


GGGAGCCG GGCTAGCTACAACGA TGAGCCTC 


2367 


161 


GCUCAGCG G CUCCCAGG 


40 


CCTGGGAG GGCTAGCTACAACGA CGCTGAGC 


2368 


169 


GCUCCCAG G UGCGGGAG 


41 


CTCCCGCA GGCTAGCTACAACGA CTGGGAGC 


2369 


171 


UCCCAGGU G CGGGAGAG 


42 


CTCTCCCG GGCTAGCTACAACGA ACCTGGGA 


2370 


182 


GGAGAGAG G CCUGCUGA 


43 


TCAGCAGG GGCTAGCTACAACGA CTCTCTCC 


2371 


186 


AGAGGCCU G CUGAAAAU 


44 


ATTTTCAG GGCTAGCTACAACGA AGGCCTCT 


2372 


193 


UGCUGAAA A UGACUGAA 


45 


TTCAGTCA GGCTAGCTACAACGA TTTCAGCA 


2373 


196 


UGAAAAUG A CUGAAUAU 


46 


ATATTCAG GGCTAGCTACAACGA CATTTTCA 


2374 


201 


AUGACUGA A UAUAAACU 


47 


AGTTTATA GGCTAGCTACAACGA TCAGTCAT 


2375 


203 


GACUGAAU A UAAACUUG 


48 


CAAGTTTA GGCTAGCTACAACGA ATTCAGTC 


2376 
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207 


GAAUAUAA A CUUGUGGU 


49 


ACCACAAG GGCTAGCTACAACGA TTATATTC 


2377 


211 


AUAAACUU G UGGUAGUU 


50 


AACTACCA GGCTAGCTACAACGA AAGTTTAT 


2378 


214 


AACUUGUG G UAGUUGGA 


51 


TCCAACTA GGCTAGCTACAACGA CACAAGTT 


2379 


217 


UUGUGGUA G UUGGAGCU 


52 


AGCTCCAA GGCTAGCTACAACGA TACCACAA 


2380 


223 


UAGUUGGA G CUUGUGGC 


53 


GCCACAAG GGCTAGCTACAACGA TCCAACTA 


2381 


227 


UGGAGCUU G UGGCGUAG 


54 


CTACGCCA GGCTAGCTACAACGA AAGCTCCA 


2382 


230 


AGCUUGUG G CGUAGGCA 


55 


TGCCTACG GGCTAGCTACAACGA CACAAGCT 


2383 


232 


CUUGUGGC G UAGGCAAG 


56 


CTTGCCTA GGCTAGCTACAACGA GCCACAAG 


2384 


236 


UGGCGUAG G CAAGAGUG 


57 


CACTCTTG GGCTAGCTACAACGA CTACGCCA 


2385 


242 


AGGCAAGA G UGCCUUGA 


58 


TCAAGGCA GGCTAGCTACAACGA TCTTGCCT 


2386 


244 


GCAAGAGU G CCUUGACG 


59 


CGTCAAGG GGCTAGCTACAACGA ACTCTTGC 


2387 


250 


GUGCCUUG A CGAUACAG 


60 


CTGTATCG GGCTAGCTACAACGA CAAGGCAC 


2388 


253 


CCUUGACG A UACAGCUA 


61 


TAGCTGTA GGCTAGCTACAACGA CGTCAAGG 


2389 


255 


UUGACGAU A CAGCUAAU 


62 


ATTAGCTG GGCTAGCTACAACGA ATCGTCAA 


2390 


258 


ACGAUACA G CUAAUUCA 


63 


TGAATTAG GGCTAGCTACAACGA TGTATCGT 


2391 


262 


UACAGCUA A UUCAGAAU 


64 


ATTCTGAA GGCTAGCTACAACGA TAGCTGTA 


2392 


269 


AAUUCAGA A UCAUUUUG 


65 


CAAAATGA GGCTAGCTACAACGA TCTGAATT 


2393 


272 


UCAGAAUC A UUUUGUGG 


66 


CCACAAAA GGCTAGCTACAACGA GATTCTGA 


2394 


277 


AUCAUUUU G UGGACGAA 


67 


TTCGTCCA GGCTAGCTACAACGA AAAATGAT 


2395 


281 


UUUUGUGG A CGAAUAUG 


68 


CATATTCG GGCTAGCTACAACGA CCACAAAA 


2396 


285 


GUGGACGA A UAUGAUCC 


69 


GGATCATA GGCTAGCTACAACGA TCGTCCAC 


2397 


287 


GGACGAAU A UGAUCCAA 


70 


TTGGATCA GGCTAGCTACAACGA ATTCGTCC 


2398 


290 


CGAAUAUG A UCCAACAA 


71 


TTGTTGGA GGCTAGCTACAACGA CATATTCG 


2399 


295 


AUGAUCCA A CAAUAGAG 


72 


CTCTATTG GGCTAG CTACAACG A TGGATCAT 


2400 


298 


AUCCAACA A UAGAGGAU 


73 


ATCCTCTA GGCTAGCTACAACGA TGTTGGAT 


2401 


305 


AAUAGAGG A UUCCUACA 


74 


TGTAGGAA GGCTAGCTACAACGA CCTCTATT 


2402 


311 


GGAUUCCU A CAGGAAGC 


75 


GCTTCCTG GGCTAGCTACAACGA AGGAATCC 


2403 


318 


UACAGGAA G CAAGUAGU 


76 


ACTACTTG GGCTAGCTACAACGA TTCCTGTA 


2404 


322 


GGAAGCAA G UAGUAAUU 


77 


AATTACTA GGCTAGCTACAACGA TTGCTTCC 


2405 


325 


AGCAAGUA G UAAUUGAU 


78 


ATCAATTA GGCTAGCTACAACGA TACTTGCT 


2406 


328 


AAGUAGUA A UUGAUGGA 


79 


TCCATCAA GGCTAGCTACAACGA TACTACTT 


2407 


332 


AGUAAUUG A UGGAGAAA 


80 


TTTCTCCA GGCTAGCTACAACGA CAATTACT 


2408 


340 


AUGGAGAA A CCUGUCUC 


81 


GAGACAGG GGCTAGCTACAACGA TTCTCCAT 


2409 


344 


AGAAACCU G UCUCUUGG 


82 


CCAAGAGA GGCTAGCTACAACGA AGGTTTCT 


2410 


353 


UCUCUUGG A UAUUCUCG 


83 


CGAGAATA GGCTAGCTACAACGA CCAAGAGA 


2411 


355 


UCUUGGAU A UUCUCGAC 


84 


GTCGAGAA GGCTAGCTACAACGA ATCCAAGA 


2412 


362 


UAUUCUCG A CACAGCAG 


85 


CTGCTGTG GGCTAGCTACAACGA CGAGAATA 


2413 


364 


UUCUCGAC A CAGCAGGU 


86 


ACCTGCTG GGCTAGCTACAACGA GTCGAGAA 


2414 


367 


UCGACACA G CAGGUCAA 


87 


TTGACCTG GGCTAGCTACAACGA TGTGTCGA 


2415 


371 


CACAGCAG G UCAAGAGG 


88 


CCTCTTGA GGCTAGCTACAACGA CTGCTGTG 


2416 


381 


CAAGAGGA G UACAGUGC 


89 


GCACTGTA GGCTAGCTACAACGA TCCTCTTG 


2417 


383 


AGAGGAGU A CAGUGCAA 


90 


TTGCACTG GGCTAGCTACAACGA ACTCCTCT 


2418 


386 


GGAGUACA G UGCAAUGA 


91 


TCATTGCA GGCTAGCTACAACGA TGTACTCC 


2419 


388 


AGUACAGU G CAAUGAGG 


92 


CCTCATTG GGCTAGCTACAACGA ACTGTACT 


2420 


391 


ACAGUGCA A UGAGGGAC 


93 


GTCCCTCA GGCTAGCTACAACGA TGCACTGT 


2421 


398 


AAUGAGGG A CCAGUACA 


94 


TGTACTGG GGCTAGCTACAACGA CCCTCATT 


2422 


402 


AGGGACCA G UACAUGAG 


95 


CTCATGTA GGCTAGCTACAACGA TGGTCCCT 


2423 


404 


GGACCAGU A CAUGAGGA 


96 


TCCTCATG GGCTAGCTACAACGA ACTGGTCC 


2424 


406 


ACCAGUAC A UGAGGACU 


97 


AGTCCTCA GGCTAGCTACAACGA GTACTGGT 


2425 


412 


ACAUGAGG A CUGGGGAG 


98 


CTCCCCAG GGCTAGCTACAACGA CCTCATGT 


2426 


422 


UGGGGAGG G CUUUCUUU 


99 


AAAGAAAG GGCTAGCTACAACGA CCTCCCCA 


2427 


431 


CUUUCUUU G UGUAUUUG 


100 


CAAATACA GGCTAGCTACAACGA AAAGAAAG 


2428 
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433 


UUCUUUGU G UAUUUGCC 


101 


GGCAAATA GG CTAG CTAC AACGA ACAAAGAA 


2429 


435 


CUUUGUGU A UUUGCCAU 


102 


ATGGCAAA GGCTAG CTAC AACGA ACACAAAG 


2430 


439 


GUGUAUUU G CCAUAAAU 


103 


ATTTATGG GGCTAG CTAC AACGA AAATACAC 


2431 


442 


UAUUUGCC A UAAAUAAU 


104 


ATTATTTA GGCTAGCTACAACGA GGCAAATA 


2432 


446 


UGCCAUAA A UAAUACUA 


105 


TAGTATTA GGCTAGCTACAACGA TTATGGCA 


2433 


449 


CAUAAAUA A UACUAAAU 


106 


ATTTAGTA GGCTAGCTACAACGA TATTTATG 


2434 


451 


UAAAUAAU A CUAAAUCA 


107 


TGATTTAG GGCTAGCTACAACGA ATTATTTA 


2435 


456 


AAUACUAA A UCAUUUGA 


108 


TCAAATGA GGCTAGCTACAACGA TTAGTATT 


2436 


459 


ACUAAAUC A UUUGAAGA 


109 


TCTTCAAA GGCTAGCTACAACGA GATTTAGT 


2437 


467 


AUUUGAAG A UAUUCACC 


110 


GGTGAATA GGCTAGCTACAACGA CTTCAAAT 


2438 


469 


UUGAAGAU A UUCACCAU 


111 


ATGGTGAA GGCTAGCTACAACGA ATCTTCAA 


2439 


473 


AGAUAUUC A CCAUUAUA 


112 


TATAATGG GGCTAGCTACAACGA GAATATCT 


2440 


476 


UAUUCACC A UUAUAGAG 


113 


CTCTATAA GGCTAGCTACAACGA GGTGAATA 


2441 


479 


UCACCAUU A UAGAGAAC 


114 


GTTCTCTA GGCTAGCTACAACGA AATGGTGA 


2442 


486 


UAUAGAGA A CAAAUUAA 


115 


TTAATTTG GGCTAGCTACAACGA TCTCTATA 


2443 


490 


GAGAACAA A UUAAAAGA 


116 


TCTTTTAA GGCTAGCTACAACGA TTGTTCTC 


2444 


499 


UUAAAAGA G UUAAGGAC 


117 


GTCCTTAA GGCTAGCTACAACGA TCTTTTAA 


2445 


506 


AGUUAAGG A CUCUGAAG 


118 


CTTCAGAG GGCTAGCTACAACGA CCTTAACT 


2446 


515 


CUCUGAAG A UGUACCUA 


119 


TAGGTACA GGCTAGCTACAACGA CTTCAGAG 


2447 


517 


CUGAAGAU G UACCUAUG 


120 


CATAGGTA GGCTAGCTACAACGA ATCTTCAG 


2448 


519 


GAAGAUGU A CCUAUGGU 


121 


ACCATAGG GGCTAGCTACAACGA ACATCTTC 


2449 


523 


AUGUACCU A UGGUCCUA 


122 


TAGGACCA GGCTAGCTACAACGA AGGTACAT 


2450 


526 


UACCUAUG G UCCUAGUA 


123 


TACTAGGA GGCTAGCTACAACGA CATAGGTA 


2451 


532 


UGGUCCUA G UAGGAAAU 


124 


ATTTCCTA GGCTAGCTACAACGA TAGGACCA 


2452 


539 


AGUAGGAA A UAAAUGUG 


125 


CACATTTA GGCTAGCTACAACGA TTCCTACT 


2453 


543 


GGAAAUAA A UGUGAUUU 


126 


AAATCACA GGCTAGCTACAACGA TTATTTCC 


2454 


545 


AAAUAAAU G UGAUUUGC 


127 


GCAAATCA GGCTAGCTACAACGA ATTTATTT 


2455 


548 


UAAAUGUG A UUUGCCUU 


128 


AAGGCAAA GGCTAGCTACAACGA CACATTTA 


2456 


552 


UGUGAUUU G CCUUCUAG 


129 


! CTAGAAGG GGCTAGCTACAACGA AAATCACA 


2457 


562 


CUUCUAGA A CAGUAGAC 


130 


GTCTACTG GGCTAGCTACAACGA TCTAGAAG 


2458 


565 


CUAGAACA G UAGACACA 


131 


TGTGTCTA GGCTAGCTACAACGA TGTTCTAG 


2459 


569 


AACAGUAG A CACAAAAC 


132 


GTTTTGTG GGCTAGCTACAACGA CTACTGTT 


2460 


571 


CAGUAGAC A CAAAACAG 


133 


CTGTTTTG GGCTAGCTACAACGA GTCTACTG 


2461 


576 


GACACAAA A CAGGCUCA 


134 


TGAGCCTG GGCTAGCTACAACGA TTTGTGTC 


2462 


580 


CAAAACAG G CUCAGGAC 


135 


GTCCTGAG GGCTAGCTACAACGA CTGTTTTG 


2463 


587 


GGCUCAGG A CUUAGCAA 


136 


TTGCTAAG GGCTAGCTACAACGA CCTGAGCC 


2464 


592 


AGGACUUA G CAAGAAGU 


137 


ACTTCTTG GGCTAGCTACAACGA TAAGTCCT 


2465 


599 


AGCAAGAA G UUAUGGAA 


138 


TTCCATAA GGCTAGCTACAACGA TTCTTGCT 


2466 


602 


AAGAAGUU A UGGAAUUC 


139 


GAATTCCA GGCTAGCTACAACGA AACTTCTT 


2467 


607 


GUUAUGGA A UUCCUUUU 


140 


AAAAGGAA GGCTAGCTACAACGA TCCATAAC 


2468 


616 


UUCCUUUU A UUGAAACA 


141 


TGTTTCAA GGCTAGCTACAACGA AAAAGGAA 


2469 


622 


UUAUUGAA A CAUCAGCA 


142 


TGCTGATG GGCTAGCTACAACGA TTCAATAA 


2470 


624 


AUUGAAAC A UCAGCAAA 


143 


TTTGCTGA GGCTAGCTACAACGA GTTTCAAT 


2471 


628 


AAACAUCA G CAAAGACA 


144 


TGTCTTTG GGCTAGCTACAACGA TGATGTTT 


2472 


634 


CAGCAAAG A CAAGACAG 


145 


CTGTCTTG GGCTAGCTACAACGA CTTTGCTG 


2473 


639 


AAGACAAG A CAGGGUGU 


146 


ACACCCTG GGCTAGCTACAACGA CTTGTCTT 


2474 


i 644 


AAGACAGG G UGUUGAUG 


147 


CATCAACA GGCTAGCTACAACGA CCTGTCTT 


2475 


646 


GACAGGGU G UUGAUGAU 


148 


ATCATCAA GGCTAGCTACAACGA ACCCTGTC 


2476 


650 


GGGUGUUG A UGAUGCCU 


149 


AGGCATCA GGCTAGCTACAACGA CAACACCC 


2477 


653 


UGUUGAUG A UGCCUUCU 


150 


AGAAGGCA GGCTAGCTACAACGA CATCAACA 


2478 


655 


UUGAUGAU G CCUUCUAU 


151 


ATAGAAGG GGCTAGCTACAACGA ATCATCAA 


2479 


662 


UGCCUUCU A UACAUUAG 


152 


CTAATGTA GGCTAGCTACAACGA AGAAGGCA 


2480 
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664 


CCUUCUAU A CAUUAGUU 


153 


AACTAATG GGCTAGCTACAACGA ATAGAAGG 


2481 


666 


UUCUAUAC A UUAGUUCG 


154 


CGAACTAA GGCTAGCTACAACGA GTATAGAA 


2482 


670 


AUACAUUA G UUCGAGAA 


155 


TTCTCGAA GGCTAGCTACAACGA TAATGTAT 


2483 


679 


UUCGAGAA A UUCGAAAA 


156 


TTTTCGAA GGCTAGCTACAACGA TTCTCGAA 


2484 


687 


AUUCGAAA A CAUAAAGA 


157 


TCTTTATG GGCTAGCTACAACGA TTTCGAAT 


2485 


689 


UCGAAAAC A UAAAGAAA 


158 


TTTCTTTA GGCTAGCTACAACGA GTTTTCGA 


2486 


700 


AAGAAAAG A UGAGCAAA 


159 


TTTGCTCA GGCTAGCTACAACGA CTTTTCTT 


2487 


704 


AAAGAUGA G CAAAGAUG 


160 


CATCTTTG GGCTAGCTACAACGA TCATCTTT 


2488 


710 


GAGCAAAG A UGGUAAAA 


161 


TTTTACCA GGCTAGCTACAACGA CTTTGCTC 


2489 


713 


CAAAGAUG G UAAAAAGA 


162 


TCTTTTTA GGCTAGCTACAACGA CATCTTTG 


2490 


732 


AAAAAGAA G UCAAAGAC 


163 


GTCTTTGA GGCTAGCTACAACGA TTCTTTTT 


2491 


739 


AGUCAAAG A CAAAGUGU 


164 


ACACTTTG GGCTAGCTACAACGA CTTTGACT 


2492 


744 


AAGACAAA G UGUGUAAU 


165 


ATTACACA GGCTAGCTACAACGA TTTGTCTT 


2493 


746 


GACAAAGU G UGUAAUUA 


166 


TAATTACA GGCTAGCTACAACGA ACTTTGTC 


2494 


748 


CAAAGUGU G UAAUUAUG 


167 


CATAATTA GGCTAGCTACAACGA ACACTTTG 


2495 


751 


AGUGUGUA A UUAUGUAA 


168 


TTACATAA GGCTAGCTACAACGA TACACACT 


2496 


754 


GUGUAAUU A UGUAAAUA 


169 


T ATT T AC A GGCTAGCTACAACGA AATTACAC 


2497 


756 


GUAAUUAU G UAAAUACA 


170 


TGTATTTA GGCTAGCTACAACGA ATAATTAC 


2498 


760 


UUAUGUAA A UACAAUUU 


171 


AAATTGTA GGCTAGCTACAACGA TTACATAA 


2499 


762 


AUGUAAAU A CAAUUUGU 


172 


ACAAATTG GGCTAGCTACAACGA ATTTACAT 


2500 


765 


UAAAUACA A UUUGUACU 


173 


AGTACAAA GGCTAGCTACAACGA TGTATTTA 


2501 


769 


UACAAUUU G UACUUUUU 


174 


AAAAAGTA GGCTAGCTACAACGA AAATTGTA 


2502 


771 


CAAUUUGU A CUUUUUUC 


175 


GAAAAAAG GGCTAGCTACAACGA ACAAATTG 


2503 


785 


UUCUUAAG G CAUACUAG 


176 


CTAGTATG GGCTAGCTACAACGA CTTAAGAA 


2504 


787 


CUUAAGGC A UACUAGUA 


177 


TACTAGTA GGCTAGCTACAACGA GCCTTAAG 


2505 


789 


UAAGGCAU A CUAGUACA 


178 


TGTACTAG GGCTAGCTACAACGA ATGCCTTA 


2506 


793 


GCAUACUA G UACAAGUG 


179 


CACTTGTA GGCTAGCTACAACGA TAGTATGC 


2507 


795 


AUACUAGU A CAAGUGGU 


180 


ACCACTTG GGCTAGCTACAACGA ACTAGTAT 


2508 


799 


UAGUACAA G UGGUAAUU 


181 


AATTACCA GGCTAGCTACAACGA TTGTACTA 


2509 ; 


802 


UACAAGUG G UAAUUUUU 


182 


AAAAATTA GGCTAGCTACAACGA CACTTGTA 


2510 


805 


AAGUGGUA A UUUUUGUA 


183 


TACAAAAA GGCTAGCTACAACGA TACCACTT 


2511 


811 


UAAUUUUU G UACAUUAC 


184 


GTAATGTA GGCTAGCTACAACGA AAAAATTA 


2512 


813 


AUUUUUGU A CAUUACAC 


185 


GTGTAATG GGCTAGCTACAACGA ACAAAAAT 


2513 


815 


UUUUGUAC A UUACACUA 


186 


TAGTGTAA GGCTAGCTACAACGA GTACAAAA 


2514 


818 


UGUACAUU A CACUAAAU 


187 


ATTTAGTG GGCTAGCTACAACGA AATGTACA 


2515 


820 


UACAUUAC A CUAAAUUA 


188 


TAATTTAG GGCTAGCTACAACGA GTAATGTA 


2516 


825 


UACACUAA A UUAUUAGC 


189 


GCTAATAA GGCTAGCTACAACGA TTAGTGTA 


2517 


828 


ACUAAAUU A UUAGCAUU 


190 


AATGCTAA GGCTAGCTACAACGA AATTTAGT 


2518 


832 


AAUUAUUA G CAUUUGUU 


191 


AACAAATG GGCTAGCTACAACGA TAATAATT 


2519 


834 


UUAUUAGC A UUUGUUUU 


192 


AAAACAAA GGCTAGCTACAACGA GCTAATAA 


2520 


838 


UAGCAUUU G UUUUAGCA 


193 


TGCTAAAA GGCTAGCTACAACGA AAATGCTA 


2521 


844 


UUGUUUUA G CAUUACCU 


194 


AGGTAATG GGCTAGCTACAACGA TAAAACAA 


2522 


846 


GUUUUAGC A UUACCUAA 


195 


TTAGGTAA GGCTAGCTACAACGA GCTAAAAC 


2523 


849 


UUAGCAUU A CCUAAUUU 


196 


AAATTAGG GGCTAGCTACAACGA AATGCTAA 


2524 


854 


AUUACCUA A UUUUUUUC 


197 


GAAAAAAA GGCTAGCTACAACGA TAGGTAAT 


2525 


865 


UUUUUCCU G CUCCAUGC 


198 


GCATGGAG GGCTAGCTACAACGA AGGAAAAA 


2526 


870 


CCUGCUCC A UGCAGACU 


199 


AGTCTGCA GGCTAGCTACAACGA GGAGCAGG 


2527 


872 


UGCUCCAU G CAGACUGU 


200 


ACAGTCTG GGCTAGCTACAACGA ATGGAGCA 


2528 


876 


CCAUGCAG A CUGUUAGC 


201 


GCTAACAG GGCTAGCTACAACGA CTGCATGG 


2529 


879 


UGCAGACU G UUAGCUUU 


202 


AAAGCTAA GGCTAGCTACAACGA AGTCTGCA 


2530 


883 


GACUGUUA G CUUUUACC 


203 


GGTAAAAG GGCTAGCTACAACGA TAACAGTC 


2531 


889 


UAGCUUUU A CCUUAAAU 


204 


ATTTAAGG GGCTAGCTACAACGA AAAAGCTA 


2532 
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896 


UACCUUAA A UGCUUAUU 


205 


AATAAGCA GG CTAG CTACAACG A TTAAGGTA 


2533 


898 


CCUUAAAU G CUUAUUUU 


206 


AAAATAAG GG CTAG CTACAACG A ATTTAAGG 


2534 


902 


AAAUGCUU A UUUUAAAA 


207 


TTTTAAAA GGCTAG CTACAACG A AAGCATTT 


2535 


910 


AUUUUAAA A UGACAGUG 


208 


CACTGTCA GGCTAG CT ACAACGA TTTAAAAT 


2536 


913 


UUAAAAUG A CAGUGGAA 


209 


TTCCACTG GGCTAG CT ACAACGA CATTTTAA 


2537 


916 


AAAUGACA G UGGAAGUU 


210 


AACTTCCA GGCTAGCTACAACGA TGTCATTT 


2538 


922 


CAGUGGAA G UUUUUUUU 


211 


AAAAAAAA GGCTAGCTACAACGA TTCCACTG 


2539 


i 939 


UCCUCGAA G UGCCAGUA 


212 


TACTGGCA GGCTAGCTACAACGA TTCGAGGA 


2540 


941 


CUCGAAGU G CCAGUAUU 


213 


AATACTGG GGCTAGCTACAACGA ACTTCGAG 


2541 


945 


AAGUGCCA G UAUUCCCA 


214 


TGGGAATA GGCTAGCTACAACGA TGGCACTT 


2542 


947 


GUGCCAGU A UUCCCAGA 


. 215 


TCTGGGAA GGCTAGCTACAACGA ACTGGCAC 


2543 


956 


UUCCCAGA G UUUUGGUU 


216 


AACCAAAA GGCTAGCTACAACGA TCTGGGAA 


2544 


962 


GAGUUUUG G UUUUUGAA 


217 


TTCAAAAA GGCTAGCTACAACGA CAAAACTC 


2545 


970 


GUUUUUGA A CUAGCAAU 


218 


ATTGCTAG GGCTAGCTACAACGA TCAAAAAC 


2546 


974 


UUGAACUA G CAAUGCCU 


219 


AGGCATTG GGCTAGCTACAACGA TAGTTCAA 


2547 


977 


AACUAGCA A UGCCUGUG 


220 


CACAGGCA GGCTAGCTACAACGA TGCTAGTT 


2548 


979 


CUAGCAAU G CCUGUGAA 


221 


TTCACAGG GGCTAGCTACAACGA ATTGCTAG 


2549 


983 


CAAUGCCU G UGAAAAAG 


222 


CTTTTTCA GGCTAGCTACAACGA AGGCATTG 


2550 


994 


AAAAAGAA A CUGAAUAC 


223 


GTATTCAG GGCTAGCTACAACGA TTCTTTTT 


2551 


999 


GAAACUGA A UACCUAAG 


224 


CTTAGGTA GGCTAGCTACAACGA TCAGTTTC 


2552 


1001 


AACUGAAU A CCUAAGAU 


225 


ATCTTAGG GGCTAGCTACAACGA ATTCAGTT 


2553 


1008 


UACCUAAG A UUUCUGUC 


226 


GACAGAAA GGCTAGCTACAACGA CTTAGGTA 


2554 


1014 


AGAUUUCU G UCUUGGGG 


227 


CCCCAAGA GGCTAGCTACAACGA AGAAATCT 


2555 


1022 


GUCUUGGG G UUUUUGGU 


228 


ACCAAAAA GGCTAGCTACAACGA CCCAAGAC 


2556 


1029 


GGUUUUUG G UGCAUGCA 


229 


TGCATGCA GGCTAGCTACAACGA CAAAAACC 


2557 


1031 


UUUUUGGU G CAUGCAGU 


230 


ACTGCATG GGCTAGCTACAACGA ACCAAAAA 


2558 


1033 


UUUGGUGC A UGCAGUUG 


231 


CAACTGCA GGCTAGCTACAACGA GCACCAAA 


2559 


1035 


UGGUGCAU G CAGUUGAU 


232 


ATCAACTG GGCTAGCTACAACGA ATGCACCA 


2560 


1038 


UGCAUGCA G UUGAUUAC 


233 


GTAATCAA GGCTAGCTACAACGA TGCATGCA 


2561 


1042 


UGCAGUUG A UUACUUCU 


234 


AGAAGTAA GGCTAGCTACAACGA CAACTGCA 


2562 


1045 


AGUUGAUU A CUUCUUAU 


235 


ATAAGAAG GGCTAGCTACAACGA AATCAACT 


2563 


1052 


UACUUCUU A UUUUUCUU 


236 


AAGAAAAA GGCTAGCTACAACGA AAGAAGTA 


2564 


1061 


UUUUUCUU A CCAAGUGU 


237 


ACACTTGG GGCTAGCTACAACGA AAGAAAAA 


2565 


1066 


CUUACCAA G UGUGAAUG 


238 


CATTCACA GGCTAGCTACAACGA TTGGTAAG 


2566 


1068 


UACCAAGU G UGAAUGUU 


239 


AACATTCA GGCTAGCTACAACGA ACTTGGTA 


2567 


1072 


AAGUGUGA A UGUUGGUG 


240 


CACCAACA GGCTAGCTACAACGA TCACACTT 


2568 


1074 


GUGUGAAU G UUGGUGUG 


241 


CACACCAA GGCTAGCTACAACGA ATTCACAC 


2569 


1078 


GAAUGUUG G UGUGAAAC 


242 


GTTTCACA GGCTAGCTACAACGA CAACATTC 


2570 


1080 


AUGUUGGU G UGAAACAA 


243 


TTGTTTCA GGCTAGCTACAACGA ACCAACAT 


2571 


1085 


GGUGUGAA A CAAAUUAA 


244 


TTAATTTG GGCTAGCTACAACGA TTCACACC 


2572 


1089 


UGAAACAA A UUAAUGAA 


245 


TTCATTAA GGCTAGCTACAACGA TTGTTTCA 


2573 


1093 


ACAAAUUA A UGAAGCUU 


246 


AAGCTTCA GGCTAGCTACAACGA TAATTTGT 


2574 


1098 


UUAAUGAA G CUUUUGAA 


247 


TTCAAAAG GGCTAGCTACAACGA TTCATTAA 


2575 


1106 


GCUUUUGA A UCAUCCCU 


248 


AGGGATGA GGCTAGCTACAACGA TCAAAAGC 


2576 | 


1109 


UUUGAAUC A UCCCUAUU 


249 


AATAGGGA GGCTAGCTACAACGA GATTCAAA 


2577 


1115 


UCAUCCCU A UUCUGUGU 


250 


ACACAGAA GGCTAGCTACAACGA AGGGATGA 


2578 


1120 


CCUAUUCU G UGUUUUAU 


251 


ATAAAACA GGCTAGCTACAACGA AGAATAGG 


2579 


1122 


UAUUCUGU G UUUUAUCU 


252 


AGATAAAA GGCTAGCTACAACGA ACAGAATA 


2580 


1127 


UGUGUUUU A UCUAGUCA 


253 


TGACTAGA GGCTAGCTACAACGA AAAACACA 


2581 


1132 


UUUAUCUA G UCACAUAA 


254 


TTATGTGA GGCTAGCTACAACGA TAGATAAA 


2582 


1135 


AUCUAGUC A CAUAAAUG 


255 


CATTTATG GGCTAGCTACAACGA GACTAGAT 


2583 


1137 


CUAGUCAC A UAAAUGGA 


256 


TCCATTTA GGCTAGCTACAACGA GTGACTAG 


2584 ; 



wo 



02/097114 



PCT/US02/16840 



90 



1141 


UCACAUAA A UGGAUUAA 


257 


TTAATCCA GGCTAGCTACAACGA TTATGTGA 


2585 


1145 


AUAAAUGG A UUAAUUAC 


258 


GTAATTAA GGCTAGCTACAACGA CCATTTAT 


2586 


1149 


AUGGAUUA A UUACUAAU 


259 


ATTAGTAA GGCTAGCTACAACGA TAATCCAT * 


2587 


1152 


GAUUAAUU A CUAAUUUC 


260 


GAAATTAG GGCTAGCTACAACGA AATTAATC 


2588 


1156 


TV *jk YTT /"TT TTV *TV T TT Tt T/*1T\ /TT TT T 

AAUUACUA A UUUCAGUU 


261 


AACTGAAA GGCTAGCTACAACGA TAGTAATT 


2589 


1162 


TTTV TV TTTTTY/*1> Y TT TAT TV /T TV AT fT 

UAAUUUCA G UUGAGACC 


262 


GGTCTCAA GGCTAGCTACAACGA TGAAATTA 


2590 


1168 


CAGUUGAG A CCUUCUAA 


263 


TTAGAAGG GGCTAGCTACAACGA CTCAACTG 


2591 


1176 


ACCUUCUA A UUGGUUUU 


264 


AAAACCAA GGCTAGCTACAACGA TAGAAGGT 


2592 


1180 


UCUAAUUG G UUUUUACU 


265 


AGTAAAAA GGCTAGCTACAACGA CAATTAGA 


2593 


1186 


UGGUUUUU A CUGAAACA 


266 


TGTTTCAG GGCTAGCTACAACGA AAAAACCA 


2594 


1192 


UUACUGAA A CAUUGAGG 


267 


CCTCAATG GGCTAGCTACAACGA TTCAGTAA 


2595 


1194 


ACUGAAAC A UUGAGGGA 


268 


TCCCTCAA GGCTAGCTACAACGA GTTTCAGT 


2596 


1202 


AUUGAGGG A CACAAAUU 


269 


AATTTGTG GGCTAGCTACAACGA CCCTCAAT 


2597 


1204 


UGAGGGAC A CAAAUUUA 


270 


TAAATTTG GGCTAGCTACAACGA GTCCCTCA 


2598 


1208 


GGACACAA A UUUAUGGG 


271 


CCCATAAA GGCTAGCTACAACGA TTGTGTCC 


2599 


1212 


ACAAAUUU A UGGGCUUC 


272 


GAAGCCCA GGCTAGCTACAACGA AAATTTGT 


2600 


1216 


AUUUAUGG G CUUCCUGA 


273 


TCAGGAAG GGCTAGCTACAACGA CCATAAAT 


2601 


1224 


GCUUCCUG A UGAUGAUXJ 


274 


AATCATCA GGCTAGCTACAACGA CAGGAAGC 


2602 


1227 


UCCUGAUG A UGAUUCUU 


275 


AAGAATCA GGCTAGCTACAACGA CATCAGGA 


2603 


1230 


UGAUGAUG A UUCUUCUA 


276 


TAGAAGAA GGCTAGCTACAACGA CATCATCA 


2604 


1240 


UCUUCUAG G CAUCAUGU 


277 


ACATGATG GGCTAGCTACAACGA CTAGAAGA 


2605 


1242 


UUCUAGGC A UCAUGUCC 


278 


GGACATGA GGCTAGCTACAACGA GCCTAGAA 


2606 


1245 


UAGGCAUC A UGUCCUAU 


279 


ATAGGACA GGCTAGCTACAACGA GATGCCTA 


2607 


1247 


GGCAUCAU G UCCUAUAG 


280 


CTATAGGA GGCTAGCTACAACGA ATGATGCC 


2608 


1252 


/*1Ti 1 T/^TT A/^t Y T\ t YTl /TYTTTTTOTT 

CAUGUCCU A UAGUUUGU 


281 


ACAAACTA GGCTAGCTACAACGA AGGACATG 


2609 


1255 


/•-tt TnriT TT\ T TTV f% t TT TT T^IY TOT\ t Y 

GUCCUAUA G UUUGUCAU 


282 


ATGACAAA GGCTAGCTACAACGA TATAGGAC 


2610 


1259 


UAUAGUUU G UCAUCCCU 


283 


AGGGATGA GGCTAGCTACAACGA AAACTATA 


2611 


1262 


AGUUUGUC A UCCCUGAU 


284 


ATCAGGGA GGCTAGCTACAACGA GACAAACT 


2612 


1269 


CAUCCCUG A UGAAUGUA 


285 


TACATTCA GGCTAGCTACAACGA CAGGGATG 


2613 


1273 


CCUGAUGA A UGUAAAGU 


286 


ACTTTACA GGCTAGCTACAACGA TCATCAGG 


2614 


1275 


UGAUGAAU G UAAAGUUA 


287 


TAACTTTA GGCTAGCTACAACGA ATTCATCA 


2615 


1280 


AAUGUAAA G UUACACUG 


288 


CAGTGTAA GGCTAGCTACAACGA TTTACATT 


2616 


1283 


GUAAAGUU A CACUGUUC 


289 


GAACAGTG GGCTAGCTACAACGA AACTTTAC 


2617 


1285 


AAAGUUAC A CUGUUCAC 


290 


GTGAACAG GGCTAGCTACAACGA GTAACTTT 


2618 


1288 


GUUACACU G UUCACAAA 


291 


TTTGTGAA GGCTAGCTACAACGA AGTGTAAC 


2619 


1292 


CACUGUUC A CAAAGGUU 


292 


AACCTTTG GGCTAGCTACAACGA GAACAGTG 


2620 


1298 


UCACAAAG G UUUUGUCU 


293 


AGACAAAA GGCTAGCTACAACGA CTTTGTGA 


2621 


1303 


AAGGUUUU G UCUCCUUU 


294 


AAAGGAGA GGCTAGCTACAACGA AAAACCTT 


2622 


1314 


UCCUUVCC A CUGCVAUU 


295 


AATAGCAG GGCTAGCTACAACGA GGAAAGGA 


2623 


1317 


UUUCCACU G CUAUUAGU 


296 


ACTAATAG GGCTAGCTACAACGA AGTGGAAA 


2624 


1320 


CCACUGCU A UUAGUCAU 


297 


ATGACTAA GGCTAGCTACAACGA AGCAGTGG 


2625 


1324 


UGCUAUUA G UCAUGGUC 


298 


GACCATGA GGCTAGCTACAACGA TAATAGCA 


2626 


1327 


UAUUAGUC A UGGUCACU 


299 


AGTGACCA GGCTAGCTACAACGA GACTAATA 


2627 


1330 


UAGUCAUG G UCACUCUC 


300 


GAGAGTGA GGCTAGCTACAACGA CATGACTA 


2628 


1333 


UCAUGGUC A CUCUCCCC 


301 


GGGGAGAG GGCTAGCTACAACGA GACCATGA 


2629 


1345 


UCCCCAAA A UAUUAUAU 


302 


ATATAATA GGCTAGCTACAACGA TTTGGGGA 


2630 


1347 


CCCAAAAU A UUAUAUUU 


303 


AAATATAA GGCTAGCTACAACGA ATTTTGGG 


2631 


1350 


AAAAUAUU A UAUUUUUU 


304 


AAAAAATA GGCTAGCTACAACGA AATATTTT 


2632 


1352 


AAUAUUAU A UUUUUUCU 


305 


AGAAAAAA GGCTAGCTACAACGA ATAATATT 


2633 


1361 


UUUUUUCU A UAAAAAGA 


306 


• TCTTTTTA GGCTAGCTACAACGA AGAAAAAA 


2634 


1375 


AGAAAAAA A UGGAAAAA 


307 


TTTTTCCA GGCTAGCTACAACGA TTTTTTCT 


2635 


1385 


GGAAAAAA A UUACAAGG 


308 


CCTTGTAA GGCTAGCTACAACGA TTTTTTCC 


2636 
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1388 


AAAAAAUU A CAAGGCAA 


309 


TTGCCTTG GG CTAG CTACAACG A AATTTTTT 


2637 


1393 


AUUACAAG G CAAUGGAA 


310 


TTCCATTG GG CTAG CTACAACG A CTTGTAAT 


2638 


1396 


ACAAGGCA A UGGAAACU 


311 


AGTTTCCA GG CTAG CTACAACG A TGCCTTGT 


2639 


1402 


CAAUGGAA A CUAUUAUA 


312 


TATAATAG GG CTAG CTACAACGA TTCCATTG 


2640 | 


1405 


UGGAAACU A UUAUAAGG 


313 


CCTTATAA GG CTAG CTACAACGA AGTTTCCA 


2641 


1408 


AAACUAUU A UAAGGCCA 


314 


TGGCCTTA GG CTAG CTACAACGA AATAGTTT 


2642 


1413 


AUUAUAAG G CCAUUUCC 


315 


GGAAATGG GGCTAGCTACAACGA CTTATAAT 


2643 


1416 


AUAAGGCC A UUUCCUUU 


316 


AAAGGAAA GGCTAGCTACAACGA GGCCTTAT 


2644 


1427 


UCCUUUUC A CAUUAGAU 


317 


ATCTAATG GGCTAGCTACAACGA GAAAAGGA 


2645 


1429 


CUUUUCAC A UUAGAUAA 


318 


TTATCTAA GGCTAGCTACAACGA GTGAAAAG 


2646 


1434 


CACAUUAG A UAAAUUAC 


319 


GTAATTTA GGCTAGCTACAACGA CTAATGTG 


2647 


1438 


UUAGAUAA A UUACUAUA 


320 


TATAGTAA GGCTAGCTACAACGA TTATCTAA 


2648 


1441 


GAUAAAUU A CUAUAAAG 


321 


CTTTATAG GGCTAGCTACAACGA AATTTATC 


2649 


1444 


AAAUUACU A UAAAGACU 


322 


AGTCTTTA GGCTAGCTACAACGA AGTAATTT 


2650 


1450 


CUAUAAAG A CUCCUAAU 


323 


ATTAGGAG GGCTAGCTACAACGA CTTTATAG 


2651 


1457 


GACUCCUA A UAGCUUUU 


324 


AAAAGCTA GGCTAGCTACAACGA TAGGAGTC 


2652 


1460 


UCCUAAUA G CUUUUUCC 


325 


GGAAAAAG GGCTAGCTACAACGA TATTAGGA 


2653 


1470 


UUUUUCCU G UUAAGGCA 


326 


TGCCTTAA GGCTAGCTACAACGA AGGAAAAA 


2654 


1476 


CUGUUAAG G CAGACCCA 


327 


TGGGTCTG GGCTAGCTACAACGA CTTAACAG 


2655 


1480 


UAAGGCAG A CCCAGUAU 


328 


ATACTGGG GGCTAGCTACAACGA CTGCCTTA 


2656 


1485 


CAGACCCA G UAUGAAUG 


329 


CATTCATA GGCTAGCTACAACGA TGGGTCTG 


2657 


1487 


GACCCAGU A UGAAUGGG 


330 


CCCATTCA GGCTAGCTACAACGA ACTGGGTC 


2658 


1491 


CAGUAUGA A UGGGAUUA 


331 


TAATCCCA GGCTAGCTACAACGA TCATACTG 


2659 


1496 


UGAAUGGG A UUAUUAUA 


332 


TATAATAA GGCTAGCTACAACGA CCCATTCA 


2660 1 


1499 


AUGGGAUU A UUAUAGCA 


333 


TGCTATAA GGCTAGCTACAACGA AATCCCAT 


2661 


1502 


GGAUUAUU A UAGCAACC 


334 


GGTTGCTA GGCTAGCTACAACGA AATAATCC 


2662 


1505 


UUAUUAUA G CAACCAUU 


335 


AATGGTTG GGCTAGCTACAACGA TATAATAA 


2663 


1508 


UUAUAGCA A CCAUUUUG 


336 


CAAAATGG GGCTAGCTACAACGA TGCTATAA 


2664 


1511 


UAGCAACC A UUUUGGGG 


337 


CCCCAAAA GGCTAGCTACAACGA GGTTGCTA 


2665 


1519 


AUUUUGGG G CUAUAUUU 


338 


AAATATAG GGCTAGCTACAACGA CCCAAAAT 


2666 


1522 


UUGGGGCU A UAUUUACA 


339 


TGTAAATA GGCTAGCTACAACGA AGCCCCAA 


2667 


1524 


GGGGCUAU A UUUACAUG 


340 


CATGTAAA GGCTAGCTACAACGA ATAGCCCC 


2666 


1528 


CUAUAUUU A CAUGCUAC 


341 


GTAGCATG GGCTAGCTACAACGA AAATATAG 


2669 


1530 


AUAUUUAC A UGCUACUA 


342 


TAGTAGCA GGCTAGCTACAACGA GTAAATAT 


2670 


1532 


AUUUACAU G CUACUAAA 


343 


TTTAGTAG GGCTAGCTACAACGA ATGTAAAT 


2671 


1535 


UACAUGCU A CUAAAUUU 


344 


AAATTTAG GGCTAGCTACAACGA AGCATGTA 


2672 


1540 


GCUACUAA A UUUUUAUA 


345 


TATAAAAA GGCTAGCTACAACGA TTAGTAGC 


2673 


1546 


AAAUUUUU A UAAUAAUU 


346 


AATTATTA GGCTAGCTACAACGA AAAAATTT 


2674 


1549 


UUUUUAUA A UAAUUGAA 


347 


TTCAATTA GGCTAGCTACAACGA TATAAAAA 


2675 


1552 


UUAUAAUA A UUGAAAAG 


348 


CTTTTCAA GGCTAGCTACAACGA TATTATAA 


2676 


1561 


UUGAAAAG A UUUUAACA 


349 


TGTTAAAA GGCTAGCTACAACGA CTTTTCAA 


2677 


1567 


AGAUUUUA A CAAGUAUA 


350 


TATACTTG GGCTAGCTACAACGA TAAAATCT 


2678 


1571 


UUUAACAA G UAUAAAAA 


351 


TTTTTATA GGCTAGCTACAACGA TTGTTAAA 


2679 


1573 


UAACAAGU A UAAAAAAA 


352 


TTTTTTTA GGCTAGCTACAACGA ACTTGTTA 


2680 


1581 


AUAAAAAA A UUCUCAUA 


353 


TATGAGAA GGCTAGCTACAACGA TTTTTTAT 


2681 


1587 


AAAUUCUC A UAGGAAUU 


354 


AATTCCTA GGCTAGCTACAACGA GAGAATTT 


2682 


1593 


UCAUAGGA A UUAAAUGU 


355 


ACATTTAA GGCTAGCTACAACGA TCCTATGA 


2683 


1598 


GGAAUUAA A UGUAGUCU 


356 


AGACTACA GGCTAGCTACAACGA TTAATTCC 


2684 ! 


1600 


AAUUAAAU G UAGUCUCC 


357 


GGAGACTA GGCTAGCTACAACGA ATTTAATT 


2685 


1603 


UAAAUGUA G UCUCCCUG 


358 


CAGGGAGA GGCTAGCTACAACGA TACATTTA 


2686 


1611 


GUCUCCCU G UGUCAGAC 


359 


GTCTGACA GGCTAGCTACAACGA AGGGAGAC 


2687 


1613 


CUCCCUGU G UCAGACUG 


360 


CAGTCTGA GGCTAGCTACAACGA ACAGGGAG 


2688 
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1618 


UGUGUCAG A CUGCUCUU 


361 


AAGAGCAG GGCTAGCTACAACGA CTGACACA 


2689 


1621 


GUCAGACU G CUCUUUCA 


362 


TGAAAGAG GGCTAGCTACAACGA AGTCTGAC 


2690 


1629 


GCUCUUUC A UAGUAUAA 


363 


TTATACTA GGCTAGCTACAACGA GAAAGAGC 


2691 


1632 


CUUUCAUA G UAUAACUU 


364 


AAGTTATA GGCTAGCTACAACGA TATGAAAG 


2692 


1634 


UUCAUAGU A UAACUUUA 


365 


TAAAGTTA GGCTAGCTACAACGA ACTATGAA 


2693 


1637 


AUAGUAUA A CUUUAAAU 


366 


ATTTAAAG GGCTAGCTACAACGA TATACTAT 


2694 


1644 


AACUUUAA A UCUUUUCU 


367 


AGAAAAGA GGCTAGCTACAACGA TTAAAGTT 


2695 


1656 


UUUCUUCA A CUUGAGUC 


368 


GACTCAAG GGCTAGCTACAACGA TGAAGAAA 


2696 


1662 


CAACUUGA G UCUUUGAA 


369 


TTCAAAGA GGCTAGCTACAACGA TCAAGTTG 


2697 


1672 


CUUUGAAG A UAGUUUUA 


370 


TAAAACTA GGCTAGCTACAACGA CTTCAAAG 


2698 


1675 


UGAAGAUA G UUUUAAUU 


371 


AATTAAAA GGCTAGCTACAACGA TATCTTCA 


2699 


1681 


UAGUUUUA A UUCUGCUU 


372 


AAGCAGAA GGCTAGCTACAACGA TAAAACTA 


2700 


1686 


UUAAUUCU G CUUGUGAC 


373 


GTCACAAG GGCTAGCTACAACGA AGAATTAA 


2701 


1690 


UUCUGCUU G UGACAUUA 


374 


TAATGTCA GGCTAGCTACAACGA AAGCAGAA 


2702 


1693 


UGCUUGUG A CAUUAAAA 


375 


TTTTAATG GGCTAGCTACAACGA CACAAGCA 


2703 


1695 


CUUGUGAC A UUAAAAGA 


376 


TCTTTTAA GGCTAGCTACAACGA GTCACAAG 


2704 


1703 


AUUAAAAG A UUAUUUGG 


377 


CCAAATAA GGCTAGCTACAACGA CTTTTAAT 


2705 


1706 


AAAAGAUU A UUUGGGCC 


378 


GGCCCAAA GGCTAGCTACAACGA AATCTTTT 


2706 


1712 


UUAUUUGG G CCAGUUAU 


379 


ATAACTGG GGCTAGCTACAACGA CCAAATAA 


2707 


1716 


UUGGGCCA G UUAUAGCU 


380 


AGCTATAA GGCTAGCTACAACGA TGGCCCAA 


2708 


1719 


GGCCAGUU A UAGCUUAU 


381 


ATAAGCTA GGCTAGCTACAACGA AACTGGCC 


2709 


1722 


CAGUUAUA G CUUAUUAG 


382 


CTAATAAG GGCTAGCTACAACGA TATAACTG 


2710 


1726 


UAUAGCUU A UUAGGUGU 


383 


ACACCTAA GGCTAGCTACAACGA AAGCTATA 


2711 


1731 


CUUAUUAG G UGUUGAAG 


384 


CTTCAACA GGCTAGCTACAACGA CTAATAAG 


2712 


1733 


UAUUAGGU G UUGAAGAG 


385 


CTCTTCAA GGCTAGCTACAACGA ACCTAATA 


2713 


1742 


UUGAAGAG A CCAAGGUU 


386 


AACCTTGG GGCTAGCTACAACGA CTCTTCAA 


2714 


1748 


AGACCAAG G UUGCAAGC 


387 


GCTTGCAA GG CTAGCTACAACGA CTTGGTCT 


2715 


1751 


CCAAGGUU G CAAGCCAG 


388 


CTGGCTTG GGCTAGCTACAACGA AACCTTGG 


2716 


1755 


GGUUGCAA G CCAGGCCC 


389 


GGGCCTGG GGCTAGCTACAACGA TTGCAACC 


2717 


1760 


CAAGCCAG G CCCUGUGU 


390 


ACACAGGG GGCTAGCTACAACGA CTGGCTTG 


2718 


1765 


CAGGCCCU G UGUGAACC 


391 


GGTTCACA GGCTAGCTACAACGA AGGGCCTG 


2719 


1767 


GGCCCUGU G UGAACCUU 


392 


AAGGTTCA GGCTAGCTACAACGA ACAGGGCC 


2720 


1771 


CUGUGUGA A CCUUGAGC 


393 


GCTCAAGG GGCTAGCTACAACGA TCACACAG 


2721 


1778 


AACCUUGA G CUUUCAUA 


394 


TATGAAAG GGCTAGCTACAACGA TCAAGGTT 


2722 


1784 


GAGCUUUC A UAGAGAGU 


395 


ACTCTCTA GGCTAGCTACAACGA GAAAGCTC 


2723 


1791 


CAUAGAGA G UUUCACAG 


396 


CTGTGAAA GGCTAGCTACAACGA TCTCTATG 


2724 


1796 


AGAGUUUC A CAGCAUGG 


397 


CCATGCTG GGCTAGCTACAACGA GAAACTCT 


2725 


1799 


GUUUCACA G CAUGGACU 


398 


AGTCCATG GGCTAGCTACAACGA TGTGAAAC 


2726 


1801 


UUCACAGC A UGGACUGU 


399 


ACAGTCCA GGCTAGCTACAACGA GCTGTGAA 


2727 


1805 


CAGCAUGG A CUGUGUGC 


400 


GCACACAG GGCTAGCTACAACGA CCATGCTG 


2728 


1808 


CAUGGACU G UGUGCCCC 


401 


GGGGCACA GGCTAGCTACAACGA AGTCCATG 


2729 


1810 


UGGACUGU G UGCCCCAC 


402 


GTGGGGCA GGCTAGCTACAACGA ACAGTCCA 


2730 


1812 


GACUGUGU G CCCCACGG 


403 


CCGTGGGG GGCTAGCTACAACGA ACACAGTC 


2731 


1817 


UGUGCCCC A CGGUCAUC 


404 


GATGACCG GGCTAGCTACAACGA GGGGCACA 


2732 


1820 


GCCCCACG G UCAUCCGA 


405 


TCGGATGA GGCTAGCTACAACGA CGTGGGGC 


2733 


1823 


CCACGGUC A UCCGAGUG 


406 


CACTCGGA GGCTAGCTACAACGA GACCGTGG 


2734 


1829 


UCAUCCGA G UGGUUGUA 


407 


TACAACCA GGCTAGCTACAACGA TCGGATGA 


2735 


1832 


UCCGAGUG G UUGUACGA 


408 


TCGTACAA GGCTAGCTACAACGA CACTCGGA 


2736 


1835 


GAGUGGUU G UACGAUGC 


409 


GCATCGTA GGCTAGCTACAACGA AACCACTC 


2737 


1837 


GUGGUUGU A CGAUGCAU 


410 


ATGCATCG GGCTAGCTACAACGA ACAACCAC 


2738 


1840 


GUUGUACG A UGCAUUGG 


411 


CCAATGCA GGCTAGCTACAACGA CGTACAAC 


2739 


1842 


UGUACGAU G CAUUGGUU 


412 


AACCAATG GGCTAGCTACAACGA ATCGTACA 


2740 
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1844 


UACGAUGC A UUGGUUAG 


413 


CTAACCAA GGCTAGCTACAACGA GCATCGTA 


2741 


1B48 


AUGCAUUG G UUAGUCAA 


414 


TTGACTAA GGCTAGCTACAACGA CAATGCAT 


2742 


1852 


AUUGGUUA G UCAAAAAU 


415 


ATTTTTGA GGCTAGCTACAACGA TAACCAAT 


2743 


18 59 


AGUCAAAA A UGGGGAGG 


416 


CCTCCCCA GGCTAGCTACAACGA TTTTGACT 


2744 


1869 


GGGGAGGG A CUAGGGCA 


417 


TGCCCTAG GGCTAGCTACAACGA CCCTCCCC 


2745 


1875 


GGACUAGG G CAGUUUGG 


418 


CCAAACTG GGCTAGCTACAACGA CCTAGTCC 


2746 


1878 


CUAGGGCA G UUUGGAUA 


419 


TATCCAAA GGCTAGCTACAACGA TGCCCTAG 


2747 


1884 


CAGUUUGG A UAGCUCAA 


420 


TTGAGCTA GGCTAGCTACAACGA CCAAACTG 


2748 


1887 


UUUGGAUA G CUCAACAA 


421 


TTGTTGAG GGCTAGCTACAACGA TATCCAAA 


2749 


1892 


AUAGCUCA A CAAGAUAC 


422 


GTATCTTG GGCTAGCTACAACGA TGAGCTAT 


2750 


1897 


f tn'R T\ /*1TV TV /~% yt IT* /~i TV TV T T/1T T 

UCAACAAG A UACAAUCU 


423 


AGATTGTA GGCTAGCTACAACGA CTTGTTGA 


2751 


1899 


AACAAGAU A CAAUCUCA 


424 


TGAGATTG GGCTAGCTACAACGA ATCTTGTT 


2752 


1902 


AAGAUACA A UCUCACUC 


425 


GAGTGAGA GGCTAGCTACAACGA TGTATCTT 


2753 


1907 


ACAAUCUC A CUCUGUGG 


426 


CCACAGAG GGCTAGCTACAACGA GAGATTGT 


2754 


1912 


CUCACUCU G UGGUGGUC 


427 


GACCACCA GGCTAGCTACAACGA AGAGTGAG 


2755 


1915 


ACUCUGUG G UGGUCCUG 


428 


CAGGACCA GGCTAGCTACAACGA CACAGAGT 


2756 


1918 


CUGUGGUG G UCCUGCUG 


429 


CAGCAGGA GGCTAGCTACAACGA CACCACAG 


2757 


1923 


GUGGUCCU G CUGACAAA 


430 


TTTGTCAG GGCTAGCTACAACGA AGGACCAC 


2758 


1927 


UCCUGCUG A CAAAUCAA 


431 


TTGATTTG GGCTAGCTACAACGA CAGCAGGA 


2759 


1931 


GCUGACAA A UCAAGAGC 


432 


GCTCTTGA GGCTAGCTACAACGA TTGTCAGC 


2760 


1938 


AAUCAAGA G CAUUGCUU 


433 


AAGCAATG GGCTAGCTACAACGA TCTTGATT 


2761 


1940 


UCAAGAGC A UUGCUUUU 


434 


AAAAGCAA GGCTAGCTACAACGA GCTCTTGA 


2762 


1943 


AGAGCAUU G CUUUUGUU 


435 


AACAAAAG GGCTAGCTACAACGA AATGCTCT 


2763 


1949 


UUGCUUUU G UUUCUUAA 


436 


TTAAGAAA GGCTAGCTACAACGA AAAAGCAA 


2764 


1962 


T1TT7V TV /"I 71 TV TV T\ mi Tl » nTTnn 

UUAAGAAA A CAAACUCU 


437 


AGAGTTTG GGCTAGCTACAACGA TTTCTTAA 


2765 


1966 


GAAAACAA A CUCUUUUU 


438 


AAAAAGAG GGCTAGCTACAACGA TTGTTTTC 


2766 


1980 


UUUUAAAA A UUACUUUU 


439 


AAAAGTAA GGCTAGCTACAACGA TTTTAAAA 


2767 


1983 


UAAAAAUU A CUUUUAAA 


440 


TTTAAAAG GGCTAGCTACAACGA AATTTTTA 


2768 


1991 


ACUUUUAA A UAUUAACU 


441 


AGTTAATA GGCTAGCTACAACGA TTAAAAGT 


2769 


1993 


UUUUAAAU A UUAACUCA 


442 


TGAGTTAA GGCTAGCTACAACGA ATTTAAAA 


2770 


1997 


AAAUAUUA A CUCAAAAG 


443 


CTTTTGAG GGCTAGCTACAACGA TAATATTT 


2771 


2005 


ACUCAAAA G UUGAGAUU 


444 


AATCTCAA GGCTAGCTACAACGA TTTTGAGT 


2772 


2011 


AAGUUGAG A UUUUGGGG 


445 


CCCCAAAA GGCTAGCTACAACGA CTCAACTT 


2773 


2019 


AUUUUGGG G UGGUGGUG 


446 


CACCACCA GGCTAGCTACAACGA CCCAAAAT 


2774 


2022 


UUGGGGUG G UGGUGUGC 


447 


GCACACCA GGCTAGCTACAACGA CACCCCAA 


2775 


2025 


GGGUGGUG G UGUGCCAA 


448 


TTGGCACA GGCTAGCTACAACGA CACCACCC 


2776 


2027 


GUGGUGGU G UGCCAAGA 


449 


TCTTGGCA GGCTAGCTACAACGA ACCACCAC 


2777 


2029 


GGUGGUGU G CCAAGACA 


450 


TGTCTTGG GGCTAGCTACAACGA ACACCACC 


2778 


2035 


GUGCCAAG A CAUUAAUU 


451 


AATTAATG GGCTAGCTACAACGA CTTGGCAC 


2779 


2037 


GCCAAGAC A UUAAUUUU 


452 


AAAATTAA GGCTAGCTACAACGA GTCTTGGC 


2780 


2041 


AGACAUUA A UUUUUUUU 


453 


AAAAAAAA GGCTAGCTACAACGA TAATGTCT 


2781 


2054 


UUUUUUAA A CAAUGAAG 


454 


CTTCATTG GGCTAGCTACAACGA TTAAAAAA 


2782 


2 057 


T TT TT TTV T\ Tl /-*T\ 7\ T TV T\ /TTin TV 

UUUAAACA A UGAAGUGA 


455 


TCACTTCA GGCTAGCTACAACGA TGTTTAAA 


2783 


2062 


ACAAUGAA G UGAAAAAG 


456 


CTTTTTCA GGCTAGCTACAACGA TTCATTGT 


2784 


2070 


GUGAAAAA G UUUUACAA 


457 


TTGTAAAA GGCTAGCTACAACGA TTTTTCAC 


2785 


2075 


AAAGUUUU A CAAUCUCU 


458 


AGAGATTG GGCTAGCTACAACGA AAAACTTT 


2786 


2078 


GUUUUACA A UCUCUAGG 


459 


CCTAGAGA GGCTAGCTACAACGA TGTAAAAC 


2787 


2086 


AUCUCUAG G UUUGGCUA 


460 


TAGCCAAA GGCTAGCTACAACGA CTAGAGAT 


2788 


2091 


UAGGUUUG G CUAGUUCU 


461 


AGAACTAG GGCTAGCTACAACGA CAAACCTA 


2789 


2095 


UUUGGCUA G UUCUCUUA 


462 


TAAGAGAA GGCTAGCTACAACGA TAGCCAAA 


2790 


2104 


UUCUCUUA A CACUGGUU 


463 


AACCAGTG GGCTAGCTACAACGA TAAGAGAA 


2791 


2106 


CUCUUAAC A CUGGUUAA 


464 


TTAACCAG GGCTAGCTACAACGA GTTAAGAG 


2792 
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2110 


UAACACUG G UUAAAUUA 


465 


TAATTTAA GGCTAGCTACAACGA CAGTGTTA 


2793 


2115 


CUGGUUAA A UUAACAUU 


466 


AATGTTAA GGCTAGCTACAACGA TTAACCAG 


2794 


2119 


UUAAAUUA A CAUUGCAU 


467 


ATGCAATG GGCTAGCTACAACGA TAATTTAA 


2795 


2121 


AAAUUAAC A UUGCAUAA 


468 


TTATGCAA GGCTAGCTACAACGA GTTAATTT 


2796 


2124 


UUAACAUU G CAUAAACA 


469 


TGTTTATG GGCTAGCTACAACGA AATGTTAA 


2797 


2126 


AACAUUGC A UAAACACU 


470 


AGTGTTTA GGCTAGCTACAACGA GCAATGTT 


2798 


2130 


UUGCAUAA A CACUUUUC 


471 


GAAAAGTG GGCTAGCTACAACGA TTATGCAA 


2799 


2132 


GCAUAAAC A CUUUUCAA 


472 


TTGAAAAG GGCTAGCTACAACGA GTTTATGC 


2800 


2141 


CUUUUCAA G UCUGAUCC 


473 


GGATCAGA GGCTAGCTACAACGA TTGAAAAG 


2801 


2146 


CAAGUCUG A UCCAUAUU 


474 


AATATGGA GGCTAGCTACAACGA CAGACTTG 


2802 


2150 


UCUGAUCC A UAUUUAAU 


475 


ATTAAATA GGCTAGCTACAACGA GGATCAGA 


2803 


2152 


UGAUCCAU A UUUAAUAA 


476 


TTATTAAA GGCTAGCTACAACGA ATGGATCA 


2804 


2157 


CAUAUUUA A UAAUGCUU 


477 


AAGCATTA GGCTAGCTACAACGA TAAATATG 


2805 


2160 


AUUUAAUA A UGCUUUAA 


478 


TTAAAGCA GGCTAGCTACAACGA TATTAAAT 


2806 


2162 


UUAAUAAU G CUUUAAAA 


479 


TTTTAAAG GGCTAGCTACAACGA ATTATTAA 


2807 


2170 


GCUUUAAA A UAAAAAUA 


480 


TATTTTTA GGCTAGCTACAACGA TTTAAAGC 


2808 


2176 


AAAUAAAA A UAAAAACA 


481 


TGTTTTTA GGCTAGCTACAACGA TTTTATTT 


2809 


2182 


AAAUAAAA A CAAUCCUU 


482 


AAGGATTG GGCTAGCTACAACGA TTTTATTT 


2810 


2185 


UAAAAACA A UCCUUUUG 


483 


CAAAAGGA GGCTAGCTACAACGA TGTTTTTA 


2811 


2194 


UCCUUUUG A UAAAUUUA 


484 


TAAATTTA GGCTAGCTACAACGA CAAAAGGA 


2812 


2198 


UUUGAUAA A UUUAAAAU 


485 


ATTTTAAA GGCTAGCTACAACGA TTATCAAA 


2813 


2205 


AAUUUAAA A UGUUACUU 


486 


AAGTAACA GGCTAGCTACAACGA TTTAAATT 


2814 


2207 


UUUAAAAU G UUACUUAU 


487 


ATAAGTAA GGCTAGCTACAACGA ATTTTAAA 


2815 


2210 


AAAAUGUU A CUUAUUUU 


488 


AAAATAAG GGCTAGCTACAACGA AACATTTT 


2816 


2214 


UGUUACUU A UUUUAAAA 


489 


TTTTAAAA GGCTAGCTACAACGA AAGTAACA 


2817 


2222 


AUUUUAAA A UAAAUGAA 


490 


TTCATTTA GGCTAGCTACAACGA TTTAAAAT 


2818 


2226 


UAAAAUAA A UGAAGUGA 


491 


TCACTTCA GGCTAGCTACAACGA TTATTTTA 


' 2819 


2231 


UAAAUGAA G UGAGAUGG 


492 


CCATCTCA GGCTAGCTACAACGA TTCATTTA 


! 2820 


2236 


GAAGUGAG A UGGCAUGG 


493 


CCATGCCA GGCTAGCTACAACGA CTCACTTC 


2821 


2239 


GUGAGAUG G CAUGGUGA 


494 


TCACCATG GGCTAGCTACAACGA CATCTCAC 


2822 


2241 


GAGAUGGC A UGGUGAGG 


495 


CCTCACCA GGCTAGCTACAACGA GCCATCTC 


2823 


2244 


AUGGCAUG G UGAGGUGA 


496 


TCACCTCA GGCTAGCTACAACGA CATGCCAT 


2824 


2249 


AUGGUGAG G UGAAAGUA 


497 


TACTTTCA GGCTAGCTACAACGA CTCACCAT 


2825 


2255 


AGGUGAAA G UAUCACUG 


498 


CAGTGATA GGCTAGCTACAACGA TTTCACCT 


2826 


2257 


GUGAAAGU A UCACUGGA 


499 


TCCAGTGA GGCTAGCTACAACGA ACTTTCAC 


2827 


2260 


AAAGUAUC A CUGGACUA 


500 


TAGTCCAG GGCTAGCTACAACGA GATACTTT 


2828 


2265 


AUCACUGG A CUAGGUUG 


501 


CAACCTAG GGCTAGCTACAACGA CCAGTGAT 


2829 


2270 


UGGACUAG G UUGUUGGU 


502 


ACCAACAA GGCTAGCTACAACGA CTAGTCCA 


2830 


2273 


ACUAGGUU G UUGGUGAC 


503 


GTCACCAA GGCTAGCTACAACGA AACCTAGT 


2831 


2277 


GGUUGUUG G UGACUUAG 


504 


CTAAGTCA GGCTAGCTACAACGA CAACAACC 


2832 


2280 


UGUUGGUG A CUUAGGUU 


505 


AACCTAAG GGCTAGCTACAACGA CACCAACA 


2833 


2286 


UGACUUAG G UUCUAGAU 


506 


ATCTAGAA GGCTAGCTACAACGA CTAAGTCA 


2834 


2293 


GGUUCUAG A UAGGUGUC 


507 


GACACCTA GGCTAGCTACAACGA CTAGAACC 


2835 


2297 


CUAGAUAG G UGUCUUUU 


508 


AAAAGACA GGCTAGCTACAACGA CTATCTAG 


2836 


2299 


AGAUAGGU G UCUUUUAG 


509 


CTAAAAGA GGCTAGCTACAACGA ACCTATCT 


2837 


2309 


CUUUUAGG A CUCUGAUU 


510 


AATCAGAG GGCTAGCTACAACGA CCTAAAAG 


2838 


2315 


GGACUCUG A UUUUGAGG 


511 


CCTCAAAA GGCTAGCTACAACGA CAGAGTCC 


2839 


2324 


UUUUGAGG A CAUCACUU 


512 


AAGTGATG GGCTAGCTACAACGA CCTCAAAA 


2840 


2326 


UUGAGGAC A UCACUUAC 


513 


GTAAGTGA GGCTAGCTACAACGA GTCCTCAA 


2841 


2329 


AGGACAUC A CUUACUAU 


514 


ATAGTAAG GGCTAGCTACAACGA GATGTCCT 


2842 


2333 


CAUCACUU A CUAUCCAU 


515 


ATGGATAG GGCTAGCTACAACGA AAGTGATG 


2843 


2336 


CACUUACU A UCCAUUUC 


516 


GAAATGGA GGCTAGCTACAACGA AGTAAGTG 


2844 
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2340 


UACUAUCC A UUUCUUCA 


517 


TGAAGAAA GGCTAGCTACAACGA GGATAGTA 


2845 


2348 


AUUUCUUC A UGUUAAAA 


518 


TTTTAACA GGCTAGCTACAACGA GAAGAAAT 


2846 


2350 


UUCUUCAU G UUAAAAGA 


519 


TCTTTTAA GGCTAGCTACAACGA ATGAAGAA 


2847 


2360 


UAAAAGAA G UCAUCUCA 


520 


TGAGATGA GGCTAGCTACAACGA TTCTTTTA 


2848 


2363 


AAGAAGUC A UCUCAAAC 


521 


GTTTGAGA GGCTAGCTACAACGA GACTTCTT 


2849 


2370 


CAUCUCAA A CUCUUAGU 


522 


ACTAAGAG GGCTAGCTACAACGA TTGAGATG 


2850 


2377 


AACUCUUA G UUUUUUUU 


523 


AAAAAAAA GGCTAGCTACAACGA TAAGAGTT 


2851 


2390 


UUUUUUUU A CACUAUGU 


524 


ACATAGTG GGCTAGCTACAACGA AAAAAAAA 


2852 


2392 


UUUUUUAC A CUAUGUGA 


525 


TCACATAG GGCTAGCTACAACGA GTAAAAAA 


2853 


2395 


UUUACACU A UGUGAUUU 


526 


AAATCACA GGCTAGCTACAACGA AGTGTAAA 


2854 


2397 


UACACUAU G UGAUUUAU 


527 


ATAAATCA GGCTAGCTACAACGA ATAGTGTA 


2855 


2400 


ACUAUGUG A UUUAUAUU 


528 


AATATAAA GGCTAGCTACAACGA CACATAGT 


2856 


2404 


UGUGAUUU A UAUUCCAU 


529 


ATGGAATA GGCTAGCTACAACGA AAATCACA 


2857 


2406 


UGAUUUAU A UUCCAUUU 


530 


AAATGGAA GGCTAGCTACAACGA ATAAATCA 


2858 


2411 


UAUAUUCC A UUUACAUA 


531 


TATGTAAA GGCTAGCTACAACGA GGAATATA 


2859 


2415 


UUCCAUUU A CAUAAGGA 


532 


TCCTTATG GGCTAGCTACAACGA AAATGGAA 


2860 


2417 


CCAUUUAC A UAAGGAUA 


533 


TATCCTTA GGCTAGCTACAACGA GTAAATGG 


2861 


2423 


ACAUAAGG A UACACUUA 


534 


TAAGTGTA GGCTAGCTACAACGA CCTTATGT 


2862 


2425 


AUAAGGAU A CACUUAUU 


535 


AATAAGTG GGCTAGCTACAACGA ATCCTTAT 


2863 


2427 


AAGGAUAC A CUUAUUUG 


536 


r CAAATAAG GGCTAGCTACAACGA GTATCCTT 


2864 


2431 


AUACACUU A UUUGUCAA 


537 


TTGACAAA GGCTAGCTACAACGA AAGTGTAT 


2865 


2435 


ACUUAUUU G UCAAGCUC 


538 


GAGCTTGA GGCTAGCTACAACGA AAATAAGT 


2866 


2440 


UUUGUCAA G CUCAGCAC 


539 


GTGCTGAG GGCTAGCTACAACGA TTGACAAA 


2867 


2445 


CAAGCUCA G CACAAUCU 


540 


AGATTGTG GGCTAGCTACAACGA TGAGCTTG 


2868 


2447 


AGCUCAGC A CAAUCUGU 


541 


ACAGATTG GGCTAGCTACAACGA GCTGAGCT 


2869 


2450 


UCAGCACA A UCUGUAAA 


542 


TTTACAGA GGCTAGCTACAACGA TGTGCTGA 


2870 


2454 


CACAAUCU G UAAAUUUU 


543 


AAAATTTA GGCTAGCTACAACGA AGATTGTG 


2871 


2458 


AUCUGUAA A UUUUUAAC 


544 


GTTAAAAA GGCTAGCTACAACGA TTACAGAT 


2872 


2465 


AAUUUUUA A CCUAUGUU 


545 


AACATAGG GGCTAGCTACAACGA TAAAAATT 


2873 


2469 


UUUAACCU A UGUUACAC 


546 


GTGTAACA GGCTAGCTACAACGA AGGTTAAA 


2874 


2471 


UAACCUAU G UUACACCA 


547 


TGGTGTAA GGCTAGCTACAACGA ATAGGTTA 


2875 


2474 


CCUAUGUU A CACCAUCU 


548 


AGATGGTG GGCTAGCTACAACGA AACATAGG 


2876 


2476 


UAUGUUAC A CCAUCUUC 


549 


GAAGATGG GGCTAGCTACAACGA GTAACATA 


2877 


2479 


GUUACACC A UCUUCAGU 


550 


ACTGAAGA GGCTAGCTACAACGA GGTGTAAC 


2878 


2486 


CAUCUUCA G UGCCAGUC 


551 


GACTGGCA GGCTAGCTACAACGA TGAAGATG 


2879 


2488 


UCUUCAGU G CCAGUCUU 


552 


AAGACTGG GGCTAGCTACAACGA ACTGAAGA 


2880 


2492 


CAGUGCCA G UCUUGGGC 


553 


GCCCAAGA GGCTAGCTACAACGA TGGCACTG 


2881 


2499 


AGUCUUGG G CAAAAUUG 


554 


CAATTTTG GGCTAGCTACAACGA CCAAGACT 


2882 


2504 


UGGGCAAA A UUGUGCAA 


555 


TTGCACAA GGCTAGCTACAACGA TTTGCCCA 


2883 


2507 


GCAAAAUU G UGCAAGAG 


556 


CTCTTGCA GGCTAGCTACAACGA AATTTTGC 


2884 


2509 


AAAAUUGU G CAAGAGGU 


557 


ACCTCTTG GGCTAGCTACAACGA ACAATTTT 


2885 


2516 


UGCAAGAG G UGAAGUUU 


558 


AAACTTCA GGCTAGCTACAACGA CTCTTGCA 


2886 


2521 


GAGGUGAA G UUUAUAUU 


559 


AATATAAA GGCTAGCTACAACGA TTCACCTC 


2887 


2525 


UGAAGUUU A UAUUUGAA 


560 


TTCAAATA GGCTAGCTACAACGA AAACTTCA 


2888 


2527 


AAGUUUAU A UUUGAAUA 


561 


TATTCAAA GGCTAGCTACAACGA ATAAACTT 


2889 


2533 


AUAUUUGA A UAUCCAUU 


562 


AATGGATA GGCTAGCTACAACGA TCAAATAT 


2890 


2535 


AUUUGAAU A UCCAUUCU 


563 


AGAATGGA GGCTAGCTACAACGA ATTCAAAT 


2891 


2539 


GAAUAUCC A UUCUCGUU 


564 


AACGAGAA GGCTAGCTACAACGA GGATATTC 


2892 


2545 


CCAUUCUC G UUUUAGGA 


565 


TCCTAAAA GGCTAGCTACAACGA GAGAATGG 


2893 


2553 


GUUUUAGG A CUCUUCUU 


566 


AAGAAGAG GGCTAGCTACAACGA CCTAAAAC 


2894 


2564 


CUUCUUCC A UAUUAGUG 


567 


CACTAATA GGCTAGCTACAACGA GGAAGAAG 


2895 


2566 


UCUUCCAU A UUAGUGUC 


568 


GACACTAA GGCTAGCTACAACGA ATGGAAGA 


2896 
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2570 


CCAUAUUA G UGUCAUCU 


569 


AGATGACA GGCTAG CTACAACG A TAATATGG 


2897 


2572 


AUAUUAGU G UCAUCUUG 


570 


CAAGATGA GGCTAGCTACAACGA ACTAATAT 


2898 


2575 


UUAGUGUC A UCUUGCCU 


571 


AGGCAAGA GGCTAGCTACAACGA GACACTAA 


2899 


2580 


GUCAUCUU G CCUCCCUA 


572 


TAGGGAGG GGCTAGCTACAACGA AAGATGAC 


2900 


2588 


GCCUCCCU A CCUUCCAC 


573 


GTGGAAGG GGCTAGCTACAACGA AGGGAGGC 


2901 


2595 


UACCUUCC A CAUGCCCC 


574 


GGGGCATG GGCTAGCTACAACGA GGAAGGTA 


2902 


2597 


CCUUCCAC A UGCCCCAU 


575 


ATGGGGCA GGCTAGCTACAACGA GTGGAAGG 


2903 


2599 


UUCCACAU G CCCCAUGA 


576 


TCATGGGG GGCTAGCTACAACGA ATGTGGAA 


2904 


'2604 


CAUGCCCC A UGACUUGA 


577 


TCAAGTCA GGCTAGCTACAACGA GGGGCATG 


2905 


2607 


GCCCCAUG A CUUGAUGC 


578 


GCATCAAG GGCTAGCTACAACGA CATGGGGC 


2906 


2612 


AUGACUUG A UGCAGUUU 


579 


AAACTGCA GGCTAGCTACAACGA CAAGTCAT 


2907 


2614 


GACUUGAU G CAGUUUUA 


580 


TAAAACTG GGCTAGCTACAACGA ATCAAGTC 


2908 


2617 


UUGAUGCA G UUUUAAUA 


581 


TATTAAAA GGCTAGCTACAACGA TGCATCAA 


2909 


2623 


CAGUUUUA A UACUUGUA 


582 


TACAAGTA GGCTAGCTACAACGA TAAAACTG 


2910 


2625 


GUUUUAAU A CUUGUAAU 


583 


ATTACAAG GGCTAGCTACAACGA ATTAAAAC 


i 2911 


2629 


UAAUACUU G UAAUUCCC 


584 


GGGAATTA GGCTAGCTACAACGA AAGTATTA 


2912 


2632 


UACUUGUA A UUCCCCUA 


585 


TAGGGGAA GGCTAGCTACAACGA TACAAGTA 


2913 


2641 


UUCCCCUA A CCAUAAGA 


586 


TCTTATGG GGCTAGCTACAACGA TAGGGGAA 


2914 


2644 


CCCUAACC A UAAGAUUU 


587 


AAATCTTA GGCTAGCTACAACGA GGTTAGGG 


2915 


2649 


ACCAUAAG A UUUACUGC 


588 


GCAGTAAA GGCTAGCTACAACGA CTTATGGT 


2916 


2653 


UAAGAUUU A CUGCUGCU 


589 


AGCAGCAG GGCTAGCTACAACGA AAATCTTA 


2917 


2656 


GAUUUACU G CUGCUGUG 


590 


CACAGCAG GGCTAGCTACAACGA AGTAAATC 


2918 


2659 


UUACUGCU G CUGUGGAU 


591 


ATCCACAG GGCTAGCTACAACGA AGCAGTAA 


2919 


2662 


CUGCUGCU G UGGAUAUC 


592 


GATATCCA GGCTAGCTACAACGA AGCAGCAG 


2920 


2666 


UGCUGUGG A UAUCUCCA 


593 


TGGAGATA GGCTAGCTACAACGA CCACAGCA 


2921 


2668 


CUGUGGAU A UCUCCAUG 


594 


CATGGAGA GGCTAGCTACAACGA ATCCACAG 


2922 


2674 


AUAUCUCC A UGAAGUUU 


595 


AAACTTCA GGCTAGCTACAACGA GGAGATAT 


2923 


2679 


UCCAUGAA G UUUUCCCA 


596 


TGGGAAAA GGCTAGCTACAACGA TTCATGGA 


2924 


2687 


GUUUUCCC A CUGAGUCA 


597 


TGACTCAG GGCTAGCTACAACGA GGGAAAAC 


2925 


2692 


CCCACUGA G UCACAUCA 


598 


TGATGTGA GGCTAGCTACAACGA TCAGTGGG 


2926 


2695 


ACUGAGUC A CAUCAGAA 


599 


TTCTGATG GGCTAGCTACAACGA GACTCAGT 


2927 


2697 


UGAGUCAC A UCAGAAAU 


600 


ATTTCTGA GGCTAGCTACAACGA GTGACTCA 


2928 


2704 


CAUCAGAA A UGCCCUAC 


601 


GTAGGGCA GGCTAGCTACAACGA TTCTGATG 


2929 


2706 


UCAGAAAU G CCCUACAU 


602 


ATGTAGGG GGCTAGCTACAACGA ATTTCTGA 


2930 


2711 


AAUGCCCU A CAUCUUAU 


603 


ATAAGATG GGCTAGCTACAACGA AGGGCATT 


2931 


2713 


UGCCCUAC A UCUUAUUU 


604 


AAATAAGA GGCTAGCTACAACGA GTAGGGCA 


2932 


2718 


UACAUCUU A UUUUCCUC 


605 


GAGGAAAA GGCTAGCTACAACGA AAGATGTA 


2933 


2730 


UCCUCAGG G CUCAAGAG 


606 


CTCTTGAG GGCTAGCTACAACGA CCTGAGGA 


2934 


2740 


UCAAGAGA A UCUGACAG 


607 


CTGTCAGA GGCTAGCTACAACGA TCTCTTGA | 


2935 


2745 


AGAAUCUG A CAGAUACC 


608 


GGTATCTG GGCTAGCTACAACGA CAGATTCT ! 


2936 


2749 


UCUGACAG A UACCAUAA 


609 


TTATGGTA GGCTAGCTACAACGA CTGTCAGA 


2937 


2751 


UGACAGAU A CCAUAAAG 


610 


CTTTATGG GGCTAGCTACAACGA ATCTGTCA 


2938 


2754 


CAGAUACC A UAAAGGGA 


611 


TCCCTTTA GGCTAGCTACAACGA GGTATCTG 


2939 


2762 


AUAAAGGG A UUUGACCU 


612 


AGGTCAAA GGCTAGCTACAACGA CCCTTTAT 


2940 


2767 


GGGAUUUG A CCUAAUCA 


613 


TGATTAGG GGCTAGCTACAACGA CAAATCCC 


2941 


2772 


UUGACCUA A UCACUAAU 


614 


ATTAGTGA GGCTAGCTACAACGA TAGGTCAA 


2942 


2775 


ACCUAAUC A CUAAUUUU 


615 


AAAATTAG GGCTAGCTACAACGA GATTAGGT 


2943 


2779 


AAUCACUA A UUUUCAGG 


616 


CCTGAAAA GGCTAGCTACAACGA TAGTGATT 


2944 


2787 


AUUUUCAG G UGGUGGCU 


617 


AGCCACCA GGCTAGCTACAACGA CTGAAAAT 


2945 


2790 


UUCAGGUG G UGGCUGAU 


618 


ATCAGCCA GGCTAGCTACAACGA CACCTGAA 


2946 


2793 


AGGUGGUG G CUGAUGCU 


619 


AG CATC AG GGCTAGCTACAACGA CACCACCT 


2947 


2797 


GGUGGCUG A UGCUUUGA 


620 


TCAAAGCA GGCTAGCTACAACGA CAGCCACC 


2948 
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2799 


UGGCUGAU G CUUUGAAC 


621 


GTTCAAAG GGCTAGCTACAACGA ATCAGCCA 


2949 


2806 


UGCUUUGA A CAUCUCUU 


622 


AAGAGATG GGCTAGCTACAACGA TCAAAGCA 


2950 


2808 


CUUUGAAC A UCUCUUUG 


623 


CAAAGAGA GGCTAGCTACAACGA GTTCAAAG 


2951 


2816 


AUCUCUUU G CUGCCCAA 


624 


TTGGGCAG GGCTAGCTACAACGA AAAGAGAT 


2952 


2819 


UCUUUGCU G CCCAAUCC 


625 


GGATTGGG GGCTAGCTACAACGA AGCAAAGA 


2953 


2824 


GCUGCCCA A UCCAUUAG 


626 


CTAATGGA GGCTAGCTACAACGA TGGGCAGC 


2954 


2828 


CCCAAUCC A UUAGCGAC 


627 


GTCGCTAA GGCTAGCTACAACGA GGATTGGG 


2955 


2832 


AUCCAUUA G CGACAGUA 


628 


TACTGTCG GGCTAGCTACAACGA TAATGGAT 


2956 


2835 


CAUUAGCG A CAGUAGGA 


629 


TCCTACTG GGCTAGCTACAACGA CGCTAATG 


2957 


2838 . 


UAGCGACA G UAGGAUUU 


630 


AAATCCTA GGCTAGCTACAACGA TGTCGCTA 


2958 


2843 


ACAGUAGG A UUUUUCAA 


631 


TTGAAAAA GGCTAGCTACAACGA CCTACTGT 


2959 


2851 


AUUUUUCA A CCCUGGUA 


632 


TACCAGGG GGCTAGCTACAACGA TGAAAAAT 


2960 


2857 


CAACCCUG G UAUGAAUA 


633 


TATTCATA GGCTAGCTACAACGA CAGGGTTG 


2961 


2859 


ACCCUGGU A UGAAUAGA 


634 


TCTATTCA GGCTAGCTACAACGA ACCAGGGT 


2962 


2863 


UGGUAUGA A UAGACAGA 


635 


TCTGTCTA GGCTAGCTACAACGA TCATACCA 


2963 


2867 


AUGAAUAG A CAGAACCC 


636 


GGGTTCTG GGCTAGCTACAACGA CTATTCAT 


2964 


2872 


UAGACAGA A CCCUAUCC 


637 


GGATAGGG GGCTAGCTACAACGA TCTGTCTA 


2965 


2877 


AGAACCCU A UCCAGUGG 


638 


CCACTGGA GGCTAGCTACAACGA AGGGTTCT 


2966 


2882 


CCUAUCCA G UGGAAGGA 


639 


TCCTTCCA GGCTAGCTACAACGA TGGATAGG 


2967 


2893 


GAAGGAGA A UUUAAUAA 


640 


TTATTAAA GGCTAGCTACAACGA TCTCCTTC 


2968 


2898 


AGAAUUUA A UAAAGAUA 


641 


TATCTTTA GGCTAGCTACAACGA TAAATTCT 


2969 


2904 


UAAUAAAG A UAGUGCAG 


642 


CTGCACTA GGCTAGCTACAACGA CTTTATTA 


2970 


2907 


UAAAGAUA G UGCAGAAA 


643 


TTTCTGCA GGCTAGCTACAACGA TATCTTTA 


2971 


2909 


AAGAUAGU G CAGAAAGA 


644 


TCTTTCTG GGCTAGCTACAACGA ACTATCTT 


2972 


2918 


CAGAAAGA A UUCCUUAG 


645 


CTAAGGAA GGCTAGCTACAACGA TCTTTCTG 


2973 


2927 


UUCCUUAG G UAAUCUAU 


646 


ATAGATTA GGCTAGCTACAACGA CTAAGGAA 


2974 


2930 


CUUAGGUA A UCUAUAAC 


647 


GTTATAGA GGCTAGCTACAACGA TACCTAAG 


2975 


2934 


GGUAAUCU A UAACUAGG 


648 


CCTAGTTA GGCTAGCTACAACGA AG ATT AC C 


2976 


2937 


AAUCUAUA A CUAGGACU 


649 


AGTCCTAG GGCTAGCTACAACGA TATAGATT 


2977 


2943 


UAACUAGG A CUACUCCU 


650 


AGGAGTAG GGCTAGCTACAACGA CCTAGTTA 


2978 


2946 


CUAGGACU A CUCCUGGU 


651 


ACCAGGAG GGCTAGCTACAACGA AGTCCTAG 


2979 


2953 


UACUCCUG G UAACAGUA 


652 


TACTGTTA GGCTAGCTACAACGA CAGGAGTA 


2980 


2956 


UCCUGGUA A CAGUAAUA 


653 


TATTACTG GGCTAGCTACAACGA TACCAGGA 


2981 


2959 


UGGUAACA G UAAUACAU 


654 


ATGTATTA GGCTAGCTACAACGA TGTTACCA 


2982 


2962 


UAACAGUA A UACAUUCC 


655 


GGAATGTA GGCTAGCTACAACGA TACTGTTA 


2983 


2964 


ACAGUAAU A.CAUUCCAU 


656 


ATGGAATG GGCTAGCTACAACGA ATTACTGT 


2984 


2966 


AGUAAUAC A UUCCAUUG 


657 


CAATGGAA GGCTAGCTACAACGA GTATTACT 


2985 


2971 


UACAUUCC A UUGUUUUA 


658 


TAAAACAA GGCTAGCTACAACGA GGAATGTA 


2986 


2974 


AUUCCAUU G UUUUAGUA 


659 


TACTAAAA GGCTAGCTACAACGA AATGGAAT 


2987 


2980 


UUGUUUUA G UAACCAGA 


660 


TCTGGTTA GGCTAGCTACAACGA TAAAACAA 


2988 


2983 


UUUUAGUA A CCAGAAAU 


661 


ATTTCTGG GGCTAGCTACAACGA TACTAAAA 


2989 


2990 


AACCAGAA A UCUUCAUG 


662 


CATGAAGA GGCTAGCTACAACGA TTCTGGTT 


2990 


2996 


AAAUCUUC A UGCAAUGA 


663 


TCATTGCA GGCTAGCTACAACGA GAAGATTT 


2991 


2998 


AUCUUCAU G CAAUGAAA 


664 


TTTCATTG GGCTAGCTACAACGA ATGAAGAT 


2992 


3001 


UUCAUGCA A UGAAAAAU 


665 


ATTTTTCA GGCTAGCTACAACGA TGCATGAA 


2993 


3008 


AAUGAAAA A UACUUUAA 


666 


TTAAAGTA GGCTAGCTACAACGA TTTTCATT 


2994 I 


3010 


UGAAAAAU A CUUUAAUU 


667 


AATTAAAG GGCTAGCTACAACGA ATTTTTCA 


2995 


3016 


AUACUUUA A UUCAUGAA 


668 


TTCATGAA GGCTAGCTACAACGA TAAAGTAT 


2996 


3020 


UUUAAUUC A UGAAGCUU 


669 


AAGCTTCA GGCTAGCTACAACGA GAATTAAA 


2997 


3025 


UUCAUGAA G CUUACUUU 


670 


AAAGTAAG GGCTAGCTACAACGA TTCATGAA 


2998 


3029 


UGAAGCUU A CUUUUUUU 


671 


AAAAAAAG GGCTAGCTACAACGA AAGCTTCA 


2999 


3044 


UUUUUUUG G UGUCAGAG 


672 


CTCTGACA GGCTAGCTACAACGA CAAAAAAA 


3000 
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3046 


UUUUUGGU G UCAGAGUC 


673 


GACTCTGA GGCTAGCTACAACGA ACCAAAAA 


3001 


3052 


GUGUCAGA G UCUCGCUC 


674 


GAGCGAGA GGCTAGCTACAACGA TCTGACAC 


3002 


3057 


AGAGUCUC G CUCUUGUC 


675 


GACAAGAG GGCTAGCTACAACGA GAGACTCT 


3003 


3063 


UCGCUCUU G UCACCCAG 


676 


CTGGGTGA GGCTAGCTACAACGA AAGAGCGA 


3004 


3066 


CUCUUGUC A CCCAGGCU 


677 


AGCCTGGG GGCTAGCTACAACGA GACAAGAG 


3005 


3072 


UCACCCAG G CUGGAAUG 


678 


CATTCCAG GGCTAGCTACAACGA CTGGGTGA 


3006 


3078 


AGGCUGGA A UGCAGUGG 


679 


CCACTGCA GGCTAGCTACAACGA TCCAGCCT 


3007 


3080 


GCUGGAAU G CAGUGGCG 


680 


CGCCACTG GGCTAGCTACAACGA ATTCCAGC 


3008 


3083 


GGAAUGCA G UGGCGCCA 


681 


TGGCGCCA GGCTAGCTACAACGA TGCATTCC 


3009 


3086 


AUGCAGUG G CGCCAUCU 


682 


AGATGGCG GGCTAGCTACAACGA CACTGCAT 


3010 


3088 


GCAGUGGC G CCAUCUCA 


683 


TGAGATGG GGCTAGCTACAACGA GCCACTGC 


3011 


3091 


GUGGCGCC A UCUCAGCU 


684 


AGCTGAGA GGCTAGCTACAACGA GGCGCCAC 


3012 


3097 


CCAUCUCA G CUCACUGC 


685 


GCAGTGAG GGCTAGCTACAACGA TGAGATGG 


3013 


3101 


CUCAGCUC A CUGCAACC 


686 


GGTTGCAG GGCTAGCTACAACGA GAGCTGAG 


3014 


3104 


AGCUCACU G CAACCUUC 


687 


GAAGGTTG GGCTAGCTACAACGA AGTGAGCT 


3015 


3107 


UCACUGCA A CCUUCCAU 


688 


ATGGAAGG GGCTAGCTACAACGA TGCAGTGA 


3016 


3114 


AACCUUCC A UCUUCCCA 


689 


TGGGAAGA GGCTAGCTACAACGA GGAAGGTT 


3017 


3124 


CUUCCCAG G UUCAAGCG 


690 


CGCTTGAA GGCTAGCTACAACGA CTGGGAAG 


3018 


3130 


AGGUUCAA G CGAUUCUC 


691 


GAGAATCG GGCTAGCTACAACGA TTGAACCT 


3019 


3133 


UUCAAGCG A UUCUCGUG 


692 


CACGAGAA GGCTAGCTACAACGA CGCTTGAA 


3020 


3139 


CGAUUCUC G UGCCUCGG 


693 


CCGAGGCA GGCTAGCTACAACGA GAGAATCG 


3021 


3141 


AUUCUCGU G CCUCGGCC 


694 


GGCCGAGG GGCTAGCTACAACGA ACGAGAAT 


3022 


3147 


GUGCCUCG G CCUCCUGA 


695 


TCAGGAGG GGCTAGCTACAACGA CGAGGCAC 


3023 


3156 


CCUCCUGA G UAGCUGGG 


696 


CCCAGCTA GGCTAGCTACAACGA TCAGGAGG 


3024 


3159 


CCUGAGUA G CUGGGAUU 


697 


AATCCCAG GGCTAGCTACAACGA TACTCAGG 


3025 


3165 


UAGCUGGG A UUACAGGC 


698 


GCCTGTAA GGCTAGCTACAACGA CCCAGCTA 


3026 


3168 


CUGGGAUU A CAGGCGUG 


699 


CACGCCTG GGCTAGCTACAACGA AATCCCAG 


3027 


3172 


GAUUACAG G CGUGUGCA 


700 


TGCACACG GGCTAGCTACAACGA CTGTAATC 


3028 


3174 


UUACAGGC G UGUGCACU 


701 


AGTGCACA GGCTAGCTACAACGA GCCTGTAA 


3029 


3176 


ACAGGCGU G UGCACUAC 


702 


GTAGTGCA GGCTAGCTACAACGA ACGCCTGT 


3030 


3178 


AGGCGUGU G CACUACAC 


703 


GTGTAGTG GGCTAGCTACAACGA ACACGCCT 


3031 


3180 


GCGUGUGC A CUACACUC 


704 


GAGTGTAG GGCTAGCTACAACGA GCACACGC 


3032 


3183 


UGUGCACU A CACUCAAC 


705 


GTTGAGTG GGCTAGCTACAACGA AGTGCACA 


3033 


3185 


UGCACUAC A CUCAACUA 


706 


TAGTTGAG GGCTAGCTACAACGA GTAGTGCA 


3034 


3190 


UACACUCA A CUAAUUUU 


707 


AAAATTAG GGCTAGCTACAACGA TGAGTGTA 


3035 


3194 


CUCAACUA A UUUUUGUA 


708 


TACAAAAA GGCTAGCTACAACGA TAGTTGAG 


3036 


3200 


UAAUUUUU G UAUUUUUA 


709 


TAAAAATA GGCTAGCTACAACGA AAAAATTA 


3037 


3202 


AUUUUUGU A UUUUUAGG 


710 


CCTAAAAA GGCTAGCTACAACGA ACAAAAAT 


3038 


3215 


UAGGAGAG A CGGGGUUU 


711 


AAACCCCG GGCTAGCTACAACGA CTCTCCTA 


3039 


3220 


GAGACGGG G UUUCACCU 


712 


AGGTGAAA GGCTAGCTACAACGA CCCGTCTC 


3040 


3225 


GGGGUUUC A CCUGUUGG 


713 


CCAACAGG GGCTAGCTACAACGA GAAACCCC 


3041 


3229 


UUUCACCU G UUGGCCAG 


714 


CTGGCCAA GGCTAGCTACAACGA AGGTGAAA 


3042 


3233 


ACCUGUUG G CCAGGCUG 


715 


CAGCCTGG GGCTAGCTACAACGA CAACAGGT 


3043 


3238 


UUGGCCAG G CUGGUCUC 


716 


GAGACCAG GGCTAGCTACAACGA CTGGCCAA 


3044 


3242 


CCAGGCUG G UCUCGAAC 


717 


GTTCGAGA GGCTAGCTACAACGA CAGCCTGG 


3045 


3249 


GGUCUCGA A CUCCUGAC 


718 


GTCAGGAG GGCTAGCTACAACGA TCGAGACC 


3046 


3256 


AACUCCUG A CCUCAAGU 


719 


ACTTGAGG GGCTAGCTACAACGA CAGGAGTT 


3047 


3263 


GACCUCAA G UGAUUCAC 


720 


GTGAATCA GGCTAGCTACAACGA TTGAGGTC 


3048 


3266 


CUCAAGUG A UUCACCCA 


721 


TGGGTGAA GGCTAGCTACAACGA CACTTGAG 


3049 


3270 


AGUGAUUC A CCCACCUU 


722 


AAGGTGGG GGCTAGCTACAACGA GAATCACT 


3050 


3274 


AUUCACCC A CCUUGGCC 


723 


GGCCAAGG GGCTAGCTACAACGA GGGTGAAT 


3051 


3280 


CCACCUUG G CCUCAUAA 


724 


TTATGAGG GGCTAGCTACAACGA CAAGGTGG 


3052 
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3285 


UUGGCCUC A UAAACCUG 


725 


CAGGTTTA GGCTAGCTACAACGA GAGGCCAA 


3053 


3289 


CCUCAUAA A CCUGUUUU 


726 


AAAACAGG GGCTAGCTACAACGA TTATGAGG 


3054 


3293 


AUAAACCU G UUUUGCAG 


727 


CTGCAAAA GGCTAGCTACAACGA AGGTTTAT 


3055 


3298 


CCUGUUUU G CAGAACUC 


728 


GAGTTCTG GGCTAGCTACAACGA AAAACAGG 


3056 


3303 


UUUGCAGA A CUCAUUUA 


729 


TAAATGAG GGCTAGCTACAACGA TCTGCAAA 


3057 


3307 


CAGAACUC A UUUAUUCA 


730 


TGAATAAA GGCTAGCTACAACGA GAGTTCTG 


3058 


3311 


ACUCAUUU A UUCAGCAA 


731 


TTGCTGAA GGCTAGCTACAACGA AAATGAGT 


3059 


3316 


UUUAUUCA G CAAAUAUU 


732 


AATATTTG GGCTAGCTACAACGA TGAATAAA 


3060 


3320 


UUCAGCAA A UAUUUAUU 


733 


AATAAATA GGCTAGCTACAACGA TTGCTGAA 


3061 


3322 


•CAGCAAAU A UUUAUUGA 


734 


TCAATAAA GGCTAGCTACAACGA ATTTGCTG 


3062 


3326 


AAAUAUUU A UUGAGUGC 


735 


GCACTCAA GGCTAGCTACAACGA AAATATTT 


3063 


3331 


UUUAUUGA G UGCCUACC 


736 


GGTAGGCA GGCTAGCTACAACGA TCAATAAA 


3064 


3333 


UAUUGAGU G CCUACCAG 


737 


CTGGTAGG GGCTAGCTACAACGA ACTCAATA 


3065 


3337 


GAGUGCCU A CCAGAUGC 


738 


GCATCTGG GGCTAGCTACAACGA AGGCACTC 


3066 


3342 


CCUACCAG A UGCCAGUC 


739 


GACTGGCA GGCTAGCTACAACGA CTGGTAGG 


3067 


3344 


UACCAGAU G CCAGUCAC 


740 


GTGACTGG GGCTAGCTACAACGA ATCTGGTA 


3068 


3348 


AG AUG CCA G UCACCGCA 


741 


TGCGGTGA GGCTAGCTACAACGA TGGCATCT 


3069 


3351 


UGCCAGUC A CCGCACAA 


742 


TTGTGCGG GGCTAGCTACAACGA GACTGGCA 


3070 


3354 


CAGUCACC G CACAAGGC 


743 


GCCTTGTG GGCTAGCTACAACGA GGTGACTG 


3071 


3356 


GUCACCGC A CAAGGCAC 


744 


GTGCCTTG GGCTAGCTACAACGA GCGGTGAC 


3072 


3361 


CGCACAAG G CACUGGGU 


745 


ACCCAGTG GGCTAGCTACAACGA CTTGTGCG 


3073 


3363 


CACAAGGC A CUGGGUAU 


746 


ATACCCAG GGCTAGCTACAACGA GCCTTGTG 


3074 


3368 


GGCACUGG G UAUAUGGU 


747 


ACCATATA GGCTAGCTACAACGA CCAGTGCC 


3075 


3370 


CACUGGGU A UAUGGUAU 


748 


ATACCATA GGCTAGCTACAACGA ACCCAGTG 


3076 


3372 


CUGGGUAU A UGGUAUCC 


749 


GGATACCA GGCTAGCTACAACGA ATACCCAG 


3077 


3375 


GGUAUAUG G UAUCCCCA 


750 


TGGGGATA GGCTAGCTACAACGA CATATACC 


3078 


3377 


UAUAUGGU A UCCCCAAA 


751 


TTTGGGGA GGCTAGCTACAACGA ACCATATA 


3079 


3385 


AUCCCCAA A CAAGAGAC 


752 


GTCTCTTG GGCTAGCTACAACGA TTGGGGAT 


3080 


3392 


AACAAGAG A CAUAAUCC 


753 


GGATTATG GGCTAGCTACAACGA CTCTTGTT 


3081 


3394 


CAAGAGAC A UAAUCCCG 


754 


CGGGATTA GGCTAGCTACAACGA GTCTCTTG 


3082 


3397 


GAGACAUA A UCCCGGUC 


755 


GACCGGGA GGCTAGCTACAACGA TATGTCTC 


3083 


3403 


UAAUCCCG G UCCUUAGG 


756 


CCTAAGGA GGCTAGCTACAACGA CGGGATTA 


3084 


3411 


GUCCUUAG G UACUGCUA 


757 


TAGCAGTA GGCTAGCTACAACGA CTAAGGAC 


3085 


3413 


CCUUAGGU A CUGCUAGU 


758 


ACTAGCAG GGCTAGCTACAACGA ACCTAAGG 


3086 


3416 


UAGGUACU G CUAGUGUG 


759 


CACACTAG GGCTAGCTACAACGA AGTACCTA 


3087 


3420 


UACUGCUA G UGUGGUCU 


760 


AGACCACA GGCTAGCTACAACGA TAGCAGTA 


3088 


3422 


CUGCUAGU G UGGUCUGU 


761 


ACAGACCA GGCTAGCTACAACGA ACTAGCAG 


3089 


3425 


CUAGUGUG G UCUGUAAU 


762 


ATTACAGA GGCTAGCTACAACGA CACACTAG 


3090 


3429 


UGUGGUCU G UAAUAUCU 


763 


AGATATTA GGCTAGCTACAACGA AGACCACA 


3091 


3432 


GGUCUGUA A UAUCUUAC 


764 


GTAAGATA GGCTAGCTACAACGA TACAGACC 


3092 


3434 


UCUGUAAU A UCUUACUA 


765 


TAGTAAGA GGCTAGCTACAACGA ATTACAGA 


3093 


3439 


AAUAUCUU A CUAAGGCC 


766 


GGCCTTAG GGCTAGCTACAACGA AAGATATT 


3094 


3445 


UUACUAAG G CCUUUGGU 


767 


ACCAAAGG GGCTAGCTACAACGA CTTAGTAA 


3095 


3452 


GGCCUUUG G UAUACGAC 


768 


GTCGTATA GGCTAGCTACAACGA CAAAGGCC 


3096 


3454 


CCUUUGGU A UACGACCC 


769 


GGGTCGTA GGCTAGCTACAACGA ACCAAAGG 


3097 


3456 


UUUGGUAU A CGACCCAG 


770 


CTGGGTCG GGCTAGCTACAACGA ATACCAAA 


3098 


3459 


GGUAUACG A CCCAGAGA 


771 


TCTCTGGG GGCTAGCTACAACGA CGTATACC 


3099 


3467 


ACCCAGAG A UAACACGA 


772 


TCGTGTTA GGCTAGCTACAACGA CTCTGGGT 


3100 


3470 


CAGAGAUA A CACGAUGC 


773 


GCATCGTG GGCTAGCTACAACGA TATCTCTG 


3101 


3472 


GAGAUAAC A CGAUGCGU 


774 


ACGCATCG GGCTAGCTACAACGA GTTATCTC 


3102 


3475 


AUAACACG A UGCGUAUU 


775 


AATACGCA GGCTAGCTACAACGA CGTGTTAT 


3103 


3477 


AACACGAU G CGUAUUUU 


776 


AAAATACG GGCTAGCTACAACGA ATCGTGTT 


3104 
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3479 


CACGAUGC G UAUUUUAG 


777 


CTAAAATA GGCTAGCTACAACGA GCATCGTG 


3105 


3481 


CGAUGCGU A UUUUAGUU 


778 


AACTAAAA GGCTAGCTACAACGA ACGCATCG 


3106 


3487 


GUAUUUUA G UUUUGCAA 


779 


TTGCAAAA GGCTAGCTACAACGA TAAAATAC 


3107 


3492 


UUAGUUUU G CAAAGAAG 


780 


CTTCTTTG GGCTAGCTACAACGA AAAACTAA 


3108 


3503 


AAGAAGGG G UUUGGUCU 


781 


AGACCAAA GGCTAGCTACAACGA CCCTTCTT 


3109 


3508 


GGGGUUUG G UCUCUGUG 


782 


CACAGAGA GGCTAGCTACAACGA CAAACCCC 


3110 


3514 


UGGUCUCU G UGCCAGCU 


783 


AGCTGGCA GGCTAGCTACAACGA AGAGACCA 


3111 


3516 


GUCUCUGU G CCAGCUCU 


784 


AGAGCTGG GGCTAGCTACAACGA ACAGAGAC 


3112 


3520 


CUGUGCCA G CUCUAUAA 


785 


TTATAGAG GGCTAGCTACAACGA TGGCACAG 


3113 


3525 


CCAGCUCU A UAAUUGUU 


786 


AACAATTA GGCTAGCTACAACGA AGAGCTGG 


3114 


3528 


GCUCUAUA A UUGUUUUG 


787 


CAAAACAA GGCTAGCTACAACGA TATAGAGC 


3115 


3531. 


CUAUAAUU G UUUUGCUA 


788 


TAGCAAAA GGCTAGCTACAACGA AATTATAG 


3116 


3536 


AUUGUUUU G CUACGAUU 


789 


AATCGTAG GGCTAGCTACAACGA AAAACAAT 


3117 


3539 


GUUUUGCU A CGAUUCCA 


790 


TGGAATCG GGCTAGCTACAACGA AGCAAAAC 


3118 


3542 


UUGCUACG A UUCCACUG 


791 


CAGTGGAA GGCTAGCTACAACGA CGTAGCAA 


3119 


3547 


ACGAUUCC A CUGAAACU 


792 


AGTTTCAG GGCTAGCTACAACGA GGAATCGT 


3120 


3553 


CCACUGAA A CUCUUCGA 


793 


TCGAAGAG GGCTAGCTACAACGA TTCAGTGG 


3121 


3561 


ACUCUUCG A UCAAGCUA 


794 


TAGCTTGA GGCTAGCTACAACGA CGAAGAGT 


3122 


3566 


UCGAUCAA G CUACUUUA 


795 


TAAAGTAG GGCTAGCTACAACGA TTGATCGA 


3123 


3569 


AUCAAGCU A CUUUAUGU 


796 


ACATAAAG GGCTAGCTACAACGA AGCTTGAT 


3124 


3574 


GCUACUUU A UGUAAAUC 


797 


GATTTACA GGCTAGCTACAACGA AAAGTAGC 


3125 


3576 


UACUUUAU G UAAAUCAC 


798 


GTGATTTA GGCTAGCTACAACGA ATAAAGTA 


3126 


3580 


UUAUGUAA A UCACUUCA 


799 


TGAAGTGA GGCTAGCTACAACGA TTACATAA 


3127 


3583 


UGUAAAUC A CUUCAUUG 


800 


CAATGAAG GGCTAGCTACAACGA GATTTACA 


3128 


3588 


AUCACUUC A UUGUUUUA 


801 


TAAAACAA GGCTAGCTACAACGA GAAGTGAT 


3129 


3591 


ACUUCAUU G UUUUAAAG 


802 


CTTTAAAA GGCTAGCTACAACGA AATGAAGT 


3130 


3602 


UUAAAGGA A UAAACUUG 


803 


CAAGTTTA GGCTAGCTACAACGA TCCTTTAA 


3131 


3606 


AGGAAUAA A CUUGAUUA 


804 


TAATCAAG GGCTAGCTACAACGA TTATTCCT 


3132 


3611 


UAAACUUG A UUAUAUUG 


805 


CAATATAA GGCTAGCTACAACGA CAAGTTTA 


3133 


3614 


ACUUGAUU A UAUUGUUU 


806 


AAACAATA GGCTAGCTACAACGA AATCAAGT 


3134 


3616 


UUGAUUAU A UUGUUUUU 


807 


AAAAACAA GGCTAGCTACAACGA ATAATCAA 


3135 


3619 


AUUAUAUU G UUUUUUUA 


808 


TAAAAAAA GGCTAGCTACAACGA AATATAAT 


3136 ! 


3627 


GUUUUUUU A UUUGGCAU 


809 


ATGCCAAA GGCTAGCTACAACGA AAAAAAAC 


3137 


3632 


UUUAUUUG G CAUAACUG 


810 


CAGTTATG GGCTAGCTACAACGA CAAATAAA 


3138 


3634 


UAUUUGGC A UAACUGUG 


811 


CACAGTTA GGCTAGCTACAACGA GCCAAATA 


3139 


3637 


UUGGCAUA A CUGUGAUU 


812 


AATCACAG GGCTAGCTACAACGA TATGCCAA 


3140 


3640 


GCAUAACU G UGAUUCUU 


813 


AAGAATCA GGCTAGCTACAACGA AGTTATGC 


3141 


3643 


UAACUGUG A UUCUUUUA 


814 


TAAAAGAA GGCTAGCTACAACGA CACAGTTA 


3142 


3654 


CUUUUAGG A CAAUUACU 


815 


AGTAATTG GGCTAGCTACAACGA CCTAAAAG 


3143 


3657 


UUAGGACA A UUACUGUA 


816 


TACAGTAA GGCTAGCTACAACGA TGTCCTAA 


3144 


3660 


GGACAAUU A CUGUACAC 


817 


GTGTACAG GGCTAGCTACAACGA AATTGTCC 


3145 


3663 


CAAUUACU G UACACAUU 


818 


AATGTGTA GGCTAGCTACAACGA AGTAATTG 


3146 


3665 


AUUACUGU A CACAUUAA 


819 


TTAATGTG GGCTAGCTACAACGA ACAGTAAT 


3147 


3667 


UACUGUAC A CAUUAAGG 


820 


CCTTAATG GGCTAGCTACAACGA GTACAGTA 


3148 


3669 


CUGUACAC A UUAAGGUG 


821 


CACCTTAA GGCTAGCTACAACGA GTGTACAG 


3149 


3675 


ACAUUAAG G UGUAUGUC 


822 


GACATACA GGCTAGCTACAACGA CTTAATGT 


3150 


3677 


AUUAAGGU G UAUGUCAG 


823 


CTGACATA GGCTAGCTACAACGA ACCTTAAT 


3151 


3679 


UAAGGUGU A UGUCAGAU 


824 


ATCTGACA GGCTAGCTACAACGA ACACCTTA 


3152 


3681 


AGGUGUAU G UCAGAUAU 


825 


ATATCTGA GGCTAGCTACAACGA ATACACCT 


3153 


3686 


UAUGUCAG A UAUUCAUA 


826 


TATGAATA GGCTAGCTACAACGA CTGACATA 


3154 


3688 


UGUCAGAU A UUCAUAUU 


827 


AATATGAA GGCTAGCTACAACGA ATCTGACA 


3155 


3692 


AGAUAUUC A UAUUGACC 


828 


GGTCAATA GGCTAGCTACAACGA GAATATCT 


3156 
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3694 


AUAUUCAU A UUGACCCA 


829 


TGGGTCAA GGCTAGCTACAACGA ATGAATAT 


3157 


3698 


UCAUAUUG A CCCAAAUG 


830 


CATTTGGG GGCTAGCTACAACGA CAATATGA 


3158 


3704 


UGACCCAA A UGUGUAAU 


831 


ATTACACA GGCTAGCTACAACGA TTGGGTCA 


3159 


3706 


ACCCAAAU G UGUAAUAU 


832 


ATATTACA GGCTAGCTACAACGA ATTTGGGT 


3160 


3708 


CCAAAUGU G UAAUAUUC 


833 


GAATATTA GGCTAGCTACAACGA ACATTTGG 


3161 


3711 


AAUGUGUA A UAUUCCAG 


834 


CTGGAATA GGCTAGCTACAACGA TACACATT 


3162 


3713 


UGUGUAAU A UUCCAGUU 


835 


AACTGGAA GGCTAGCTACAACGA ATTACACA 


3163 


3719 


AUAUUCCA G UUUUCUCU 


836 


AGAGAAAA GGCTAGCTACAACGA TGGAATAT 


3164 


3728 


UUUUCUCU G CAUAAGUA 


837 


TACTTATG GGCTAGCTACAACGA AGAGAAAA 


3165 


3730 


UUCUCUGC A UAAGUAAU 


838 


ATTACTTA GGCTAGCTACAACGA GCAGAGAA 


3166 


3734 


CUGCAUAA G UAAUUAAA 


839 


TTTAATTA GGCTAGCTACAACGA TTATGCAG 


3167 


3737 


CAUAAGUA A UUAAAAUA 


840 


TATTTTAA GGCTAGCTACAACGA TACTTATG 


3168 


3743 


UAAUUAAA A UAUACUUA 


841 


TAAGTATA GGCTAGCTACAACGA TTTAATTA 


3169 


3745 


AUUAAAAU A UACUUAAA 


842 


TTTAAGTA GGCTAGCTACAACGA ATTTTAAT 


3170 


3747 


UAAAAUAU A CUUAAAAA 


843 


TTTTTAAG GGCTAGCTACAACGA ATATTTTA 


3171 


3755 


ACUUAAAA A UUAAUAGU 


844 


ACTATTAA GGCTAGCTACAACGA TTTTAAGT 


3172 


3759 


AAAAAUUA A UAGUUUUA 


845 


TAAAACTA GGCTAGCTACAACGA TAATTTTT 


3173 


3762 


AAUUAAUA G UUUUAUCU 


846 


AGATAAAA GGCTAGCTACAACGA TATTAATT 


3174 


3767 


AUAGUUUU A UCUGGGUA 


847 


TACCCAGA GGCTAGCTACAACGA AAAACTAT 


3175 


3773 


UUAUCUGG G UACAAAUA 


848 


TATTTGTA GGCTAGCTACAACGA CCAGATAA 


3176 


3775 


AUCUGGGU A CAAAUAAA 


849 , 


TTTATTTG GGCTAGCTACAACGA ACCCAGAT 


3177 


3779 


GGGUACAA A UAAACAGU 


850 


ACTGTTTA GGCTAGCTACAACGA TTGTACCC 


3178 


3783 


ACAAAUAA A CAGUGCCU 


851 


AGGCACTG GGCTAGCTACAACGA TTATTTGT 


3179 


3786 


AAUAAACA G UGCCUGAA 


852 


TTCAGGCA GGCTAGCTACAACGA TGTTTATT 


3180 


3788 


UAAACAGU G CCUGAACU 


853 


AGTTCAGG GGCTAGCTACAACGA ACTGTTTA 


3181 


3794 


GUGCCUGA A CUAGUUCA 


854 


TGAACTAG GGCTAGCTACAACGA TCAGGCAC 


3182 


3798 


CUGAACUA G UUCACAGA 


855 


TCTGTGAA GGCTAGCTACAACGA TAGTTCAG 


3183 


3802 


ACUAGUUC A CAGACAAG 


856 


CTTGTCTG GGCTAGCTACAACGA GAACTAGT 


3184 


3806 


GUUCACAG A CAAGGGAA 


857 


TTCCCTTG GGCTAGCTACAACGA CTGTGAAC 


3185 


3815 


CAAGGGAA A CUUCUAUG 


858 


CATAGAAG GGCTAGCTACAACGA TTCCCTTG 


3186 


3821 


AAACUUCU A UGUAAAAA 


859 


TTTTTACA GGCTAGCTACAACGA AGAAGTTT 


3187 


3823 


ACUUCUAU G UAAAAAUC 


860 


GATTTTTA GGCTAGCTACAACGA ATAGAAGT 


3188 


3829 


AUGUAAAA A UCACUAUG 


861 


CATAGTGA GGCTAGCTACAACGA TTTTACAT 


3189 


3832 


UAAAAAUC A CUAUGAUU 


862 


AATCATAG GGCTAGCTACAACGA GATTTTTA 


3190 


3835 


AAAUCACU A UGAUUUCU 


863 


AGAAATCA GGCTAGCTACAACGA AGTGATTT 


3191 


3838 


UCACUAUG A UUUCUGAA 


864 


TTCAGAAA GGCTAGCTACAACGA CATAGTGA 


3192 


3846 


AUUUCUGA A UUGCUAUG 


865 


CATAGCAA GGCTAGCTACAACGA TCAGAAAT 


3193 


3849 


UCUGAAUU G CUAUGUGA 


B66 


TC AC AT AG GGCTAGCTACAACGA AATTCAGA 


3194 


3852 


GAAUUGCU A UGUGAAAC 


867 


GTTTCACA GGCTAGCTACAACGA AGCAATTC 


3195 


3854 


AUUGCUAU G UGAAACUA 


868 


TAGTTTCA GGCTAGCTACAACGA ATAGCAAT 


3196 


3859 


UAUGUGAA A CUACAGAU 


869 


ATCTGTAG GGCTAGCTACAACGA TTCACATA 


3197 


3862 


GUGAAACU A CAGAUCUU 


870 


AAGATCTG GGCTAGCTACAACGA AGTTTCAC 


3198 


3866 


AACUACAG A UCUUUGGA 


871 


TCCAAAGA GGCTAGCTACAACGA CTGTAGTT 


3199 


3875 


UCUUUGGA A CACUGUUU 


872 


AAACAGTG GGCTAGCTACAACGA TCCAAAGA 


3200 


3877 


UUUGGAAC A CUGUUUAG 


873 


CTAAACAG GGCTAGCTACAACGA GTTCCAAA 


3201 


3880 


GGAACACU G UUUAGGUA 


874 


TACCTAAA GGCTAGCTACAACGA AGTGTTCC 


3202 


3886 


CUGUUUAG G UAGGGUGU 


875 


ACACCCTA GGCTAGCTACAACGA CTAAACAG 


3203 


3891 


UAGGUAGG G UGUUAAGA 


876 


TCTTAACA GGCTAGCTACAACGA CCTACCTA 


3204 


3893 


GGUAGGGU G UUAAGACU 


877 


AGTCTTAA GGCTAGCTACAACGA ACCCTACC 


3205 


3899 


GUGUUAAG A CUUGACAC 


878 


GTGTCAAG GGCTAGCTACAACGA CTTAACAC 


3206 


3904 


AAGACUUG A CACAGUAC 


879 


GTACTGTG GGCTAGCTACAACGA CAAGTCTT 


3207 


3906 


GACUUGAC A CAGUACCU 


880 


AGGTACTG GGCTAGCTACAACGA GTCAAGTC 


3208 
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3909 


UUGACACA G UACCUCGU 


881 


ACGAGGTA GGCTAGCTACAACGA TGTGTCAA 


3209 


3911 


GACACAGU A CCUCGUUU 


882 


AAACGAGG GGCTAGCTACAACGA ACTGTGTC 


3210 


3916 


AGUACCUC G UUUCUACA 


883 


TGTAGAAA GGCTAGCTACAACGA GAGGTACT 


3211 


3922 


UCGUUUCU A CACAGAGA 


884 


TCTCTGTG GGCTAGCTACAACGA AGAAACGA 


3212 


3924 


GUUUCUAC A CAGAGAAA 


885 


TTTCTCTG GGCTAGCTACAACGA GTAGAAAC 


3213 


3936 


AGAAAGAA A UGGCCAUA 


886 


TATGGCCA GGCTAGCTACAACGA TTCTTTCT 


3214 


3939 


AAGAAAUG G CCAUACUU 


887 


AAGTATGG GGCTAGCTACAACGA CATTTCTT 


3215 


3942 


AAAUGGCC A UACUUCAG 


888 


CTGAAGTA GGCTAGCTACAACGA GGCCATTT 


3216 [ 


3944 


AUGGCCAU A CUUCAGGA 


889 


TCCTGAAG GGCTAGCTACAACGA ATGGCCAT 


3217 


3953 


CUUCAGGA A CUGCAGUG 


890 


CACTGCAG GGCTAGCTACAACGA TCCTGAAG 


3218 


3956 


CAGGAACU G CAGUGCUU 


891 


AAGCACTG GGCTAGCTACAACGA AGTTCCTG 


3219 


3959 


GAACUGCA G UGCUUAUG 


892 


CATAAGCA GGCTAGCTACAACGA TGCAGTTC 


3220 


3961 


ACUGCAGU G CUUAUGAG 


893 


CTCATAAG GGCTAGCTACAACGA ACTGCAGT 


3221 ! 


3965 


CAGUGCUU A UGAGGGGA 


894 


TCCCCTCA GGCTAGCTACAACGA AAGCACTG 


3222 


3973 


AUGAGGGG A UAUUUAGG 


895 


CCTAAATA GGCTAGCTACAACGA CCCCTCAT 


3223 


3975 


GAGGGGAU A UUUAGGCC 


896 


GGCCTAAA GGCTAGCTACAACGA ATCCCCTC 


3224 


3981 


AUAUUUAG G CCUCUUGA 


897 


TCAAGAGG GGCTAGCTACAACGA CTAAATAT 


3225 


3990 


CCUCUUGA A UUUUUGAU 


898 


ATCAAAAA GGCTAGCTACAACGA TCAAGAGG 


3226 


3997 


AAUUUUUG A UGUAGAUG 


899 


CATCTACA GGCTAGCTACAACGA CAAAAATT 


3227 


3999 


UUUUUGAU G UAGAUGGG 


900 


CCCATCTA GGCTAGCTACAACGA ATCAAAAA 


3228 


4003 


UGAUGUAG A UGGGCAUU 


901 


AATGCCCA GGCTAGCTACAACGA CTACATCA 


3229 


4007 


GUAGAUGG G GAUUUUUU 


902 


AAAAAATG GGCTAGCTACAACGA CCATCTAC 


3230 


4009 


AGAUGGGC A UUUUUUUA 


903 


TAAAAAAA GGCTAGCTACAACGA GCCCATCT 


3231 


4020 


UUUUUAAG G UAGUGGUU 


904 


AACCACTA GGCTAGCTACAACGA CTTAAAAA 


3232 


4023 


UUAAGGUA G UGGUUAAU 


905 


ATTAACCA GGCTAGCTACAACGA TACCTTAA 


3233 


4026 


AGGUAGUG G UUAAUUAC 


906 


GTAATTAA GGCTAGCTACAACGA CACTACCT 


3234 


4030 


AGUGGUUA A UUACCUUU 


907 


AAAGGTAA GGCTAGCTACAACGA TAACCACT 


3235 


4033 


GGUUAAUU A CCUUUAUG 


908 


CATAAAGG GGCTAGCTACAACGA AATTAACC 


3236 


4039 


UUACCUUU A UGUGAACU 


909 


AGTTCACA GGCTAGCTACAACGA AAAGGTAA 


3237 


4041 


ACCUUUAU G UGAACUUU 


910 


AAAGTTCA GGCTAGCTACAACGA ATAAAGGT 


3238 


4045 


UUAUGUGA A CUUUGAAU 


911 


ATTCAAAG GGCTAGCTACAACGA TCACATAA 


3239 


4052 


AACUUUGA A UGGUUUAA 


912 


TTAAACCA GGCTAGCTACAACGA TCAAAGTT 


3240 


4055 


UUUGAAUG G UUUAACAA 


913 


TTGTTAAA GGCTAGCTACAACGA CATTCAAA 


3241 


4060 


AUGGUUUA A CAAAAGAU 


914 


ATCTTTTG GGCTAGCTACAACGA TAAACCAT 


3242 


4067 


AACAAAAG A UUUGUUUU 


915 


AAAACAAA GGCTAGCTACAACGA CTTTTGTT 


3243 


4071 


AAAGAUUU G UUUUUGUA 


916 1 


TACAAAAA GGCTAGCTACAACGA AAATCTTT 


3244 


4077 


UUGUUUUU G UAGAGAUU 


917 


AATCTCTA GGCTAGCTACAACGA AAAAACAA 


3245 


4083 


UUGUAGAG A UUUUAAAG 


918 


CTTTAAAA GGCTAGCTACAACGA CTCTACAA 


3246 


4099 


GGGGGAGA A UUCUAGAA 


919 


TTCTAGAA GGCTAGCTACAACGA TCTCCCCC 


3247 


4108 


UUCUAGAA A UAAAUGUU 


920 


AACATTTA GGCTAGCTACAACGA TTCTAGAA 


3248 


4112 


AGAAAUAA A UGUUACCU 


921 


AGGTAACA GGCTAGCTACAACGA TTATTTCT 


3249 


4114 


AAAUAAAU G UUACCUAA 


922 


TTAGGTAA GGCTAGCTACAACGA ATTTATTT 


3250 


4117 


UAAAUGUU A CCUAAUUA 


923 


TAATTAGG GGCTAGCTACAACGA AACATTTA 


3251 


4122 


GUUACCUA A UUAUUACA 


924 


TGTAATAA GGCTAGCTACAACGA TAGGTAAC 


3252 


4125 


ACCUAAUU A UUACAGCC 


925 


GGCTGTAA GGCTAGCTACAACGA AATTAGGT 


3253 


4128 


UAAUUAUU A CAGCCUUA 


926 


TAAGGCTG GGCTAGCTACAACGA AATAATTA 


3254 


4131 


UUAUUACA G CCUUAAAG 


927 


CTTTAAGG GGCTAGCTACAACGA TGTAATAA 


3255 


4140 


CCUUAAAG A CAAAAAUC 


928 


GATTTTTG GGCTAGCTACAACGA CTTTAAGG 


3256 


4146 


AGACAAAA A UCCUUGUU 


929 


AACAAGGA GGCTAGCTACAACGA TTTTGTCT 


3257 


4152 


AAAUCCUU G UUGAAGUU 


930 


AACTTCAA GGCTAGCTACAACGA AAGGATTT 


3258 


4158 


UUGUUGAA G UUUUUUUA 


931 


TAAAAAAA GGCTAGCTACAACGA TTCAACAA 


3259 


4174 


AAAAAAAG A CUAAAUUA 


932 


TAATTTAG GGCTAGCTACAACGA CTTTTTTT 


3260 
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4179 


AAGACUAA A UUACAUAG 


933 


CTATGTAA GGCTAG CTACAACGA TTAGTCTT 


3261 


4182 


ACUAAAUU A CAUAGACU 


934 


AGTCTATG GGCTAGCTACAACGA AATTTAGT 


3262 


4184 


UAAAUUAC A UAGACUUA 


935 


TAAGTCTA GGCTAGCTACAACGA GTAATTTA 


3263 


4188 


UUACAUAG A CUUAGGCA 


936 


TGCCTAAG GGCTAGCTACAACGA CTATGTAA 


3264 


4194 


AGACUUAG G CAUUAACA 


937 


TGTTAATG GGCTAGCTACAACGA CTAAGTCT 


3265 


4196 


ACUUAGGC A UUAACAUG 


938 


CATGTTAA GGCTAGCTACAACGA GCCTAAGT 


3266 


4200 


AGGCAUUA A CAUGUUUG 


939 


CAAACATG GGCTAGCTACAACGA TAATGCCT 


3267 


4202 


GCAUUAAC A UGUUUGUG 


940 


CACAAACA GGCTAGCTACAACGA GTTAATGC 


3268 


4204 


AUUAACAU G UUUGUGGA 


941 


TCCACAAA GGCTAGCTACAACGA ATGTTAAT 


3269 


4208 


ACAUGUUU G UGGAAGAA 


942 


TTCTTCCA GGCTAGCTACAACGA AAACATGT 


3270 


4216 


GUGGAAGA A UAUAGCAG 


943 


CTGCTATA GGCTAGCTACAACGA TCTTCCAC 


3271 


4218 


GGAAGAAU A UAGCAGAC 


944 


GTCTGCTA GGCTAGCTACAACGA ATTCTTCC 


3272 


4221 


AGAAUAUA G CAGACGUA 


945 


TACGTCTG GGCTAGCTACAACGA TATATTCT 


3273 


4225 


UAUAGCAG A CGUAUAUU 


946 


AATATACG GGCTAGCTACAACGA CTGCTATA 


3274 


4227 


UAGCAGAC G UAUAUUGU 


947 


ACAATATA GGCTAGCTACAACGA GTCTGCTA 


3275 


4229 


GCAGACGU A UAUUGUAU 


948 


ATACAATA GGCTAGCTACAACGA ACGTCTGC 


3276 


4231 


AGACGUAU A UUGUAUCA 


949 


TGATACAA GGCTAGCTACAACGA ATACGTCT 


3277 | 


4234 


CGUAUAUU G UAUCAUUU 


950 


AAATGATA GGCTAGCTACAACGA AATATACG 


3278 


4236 


UAUAUUGU A UCAUUUGA 


951 


TCAAATGA GGCTAGCTACAACGA ACAATATA 


3279 


4239 


AUUGUAUC A UUUGAGUG 


952 


CACTCAAA GGCTAGCTACAACGA GATACAAT 


3280 


4245 


UCAUUUGA G UGAAUGUU 


953 


AACATTCA GGCTAGCTACAACGA TCAAATGA 


3281 


4249 


UUGAGUGA A UGUUCCCA 


954 


TGGGAACA GGCTAGCTACAACGA TCACTCAA 


3282 


4251 


GAGUGAAU G UUCCCAAG 


955 


CTTGGGAA GGCTAGCTACAACGA ATTCACTC 


3283 


4259 


GUUCCCAA G UAGGCAUU 


956 


AATGCCTA GGCTAGCTACAACGA TTGGGAAC 


3284 


4263 


CCAAGUAG G CAUUCUAG 


957 


CTAGAATG GGCTAGCTACAACGA CTACTTGG 


3285 


4265 


AAGUAGGC A UUCUAGGC 


958 


GCCTAGAA GGCTAGCTACAACGA GCCTACTT 


3286 


4272 


CAUUCUAG G CUCUAUUU 


959 


AAATAGAG GGCTAGCTACAACGA CTAGAATG 


3287 


4277 


UAGGCUCU A UUUAACUG 


960 


CAGTTAAA GGCTAGCTACAACGA AGAGCCTA 


3288 


4282 


UCUAUUUA A CUGAGUCA 


961 


TGACTCAG GGCTAGCTACAACGA TAAATAGA 


3289 


4287 


UUAACUGA G UCACACUG 


962 


CAGTGTGA GGCTAGCTACAACGA TCAGTTAA 


3290 


4290 


ACUGAGUC A CACUGCAU 


963 


ATGCAGTG GGCTAGCTACAACGA GACTCAGT 


3291 


4292 


UGAGUCAC A CUGCAUAG 


964 


CTATGCAG GGCTAGCTACAACGA GTGACTCA 


3292 


4295 


GUCACACU G CAUAGGAA 


965 


TTCCTATG GGCTAGCTACAACGA AGTGTGAC 


3293 


4297 


CACACUGC A UAGGAAUU 


966 


AATTCCTA GGCTAGCTACAACGA GCAGTGTG 


3294 


4303 


GCAUAGGA A UUUAGAAC 


967 


GTTCTAAA GGCTAGCTACAACGA TCCTATGC 


3295 


4310 


AAUUUAGA A CCUAACUU 


968 


AAGTTAGG GGCTAGCTACAACGA TCTAAATT 


3296 


4315 


AGAACCUA A CUUUUAUA 


969 


TATAAAAG GGCTAGCTACAACGA TAGGTTCT 


3297 


4321 


UAACUUUU A UAGGUUAU 


970 


ATAACCTA GGCTAGCTACAACGA AAAAGTTA 


3298 


4325 


UUUUAUAG G UUAUCAAA 


971 


TTTGATAA GGCTAGCTACAACGA CTATAAAA 


3299 


4328 


UAUAGGUU A UCAAAACU 


972 


AGTTTTGA GGCTAGCTACAACGA AACCTATA 


3300 


4334 


UUAUCAAA A CUGUUGUC 


973 


GACAACAG GGCTAGCTACAACGA TTTGATAA 


3301 


4337 


UCAAAACU G UUGUCACC 


974 


GGTGACAA GGCTAGCTACAACGA AGTTTTGA 


3302 


4340 


AAACUGUU G UCACCAUU 


975 


AATGGTGA GGCTAGCTACAACGA AACAGTTT 


3303 


4343 


CUGUUGUC A CCAUUGCA 


976 


TGCAATGG GGCTAGCTACAACGA GACAACAG 


3304 


4346 


UUGUCACC A UUGCACAA 


977 


TTGTGCAA GGCTAGCTACAACGA GGTGACAA 


3305 


4349 


UCACCAUU G CACAAUUU 


978 


AAATTGTG GGCTAGCTACAACGA AATGGTGA 


3306 


4351 


ACCAUUGC A CAAUUUUG 


979 


CAAAATTG GGCTAGCTACAACGA GCAATGGT 


3307 


4354 


AUUGCACA A UUUUGUCC 


980 


GGACAAAA GGCTAGCTACAACGA TGTGCAAT 


3308 


4359 


ACAAUUUU G UCCUAAUA 


981 


TATTAGGA GGCTAGCTACAACGA AAAATTGT 


3309 


4365 


UUGUCCUA A UAUAUACA 


982 


TGTATATA GGCTAGCTACAACGA TAGGACAA 


3310 


4367 


GUCCUAAU A UAUACAUA 


983 


TATGTATA GGCTAGCTACAACGA ATTAGGAC 


3311 


4369 


CCUAAUAU A UACAUAGA 


984 


TCTATGTA GGCTAGCTACAACGA ATATTAGG 


3312 
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4371 


UAAUAUAU A CAUAGAAA 


985 


TTTCTATG GGCT AG CTACAACGA ATATATTA 


3313 


4373 


AUAUAUAC A UAGAAACU 


986 


AGTTTCTA GGCTAG CTACAACGA GTATATAT 


3314 


4379 


ACAUAGAA A CUUUGUGG 


987 


CCACAAAG GGCTAG CTACAACGA TTCTATGT 


3315 


4384 


GAAACUUU G UGGGGCAU 


988 


ATGCCCCA GGCTAG CTACAACGA AAAGTTTC 


3316 


4389 


UUUGUGGG G CAUGUUAA 


989 


TTAACATG GGCTAG CTACAACGA CCCACAAA 


3317 


4391 


UGUGGGGC A UGUUAAGU 


990 


ACTTAACA GGCTAGCTACAACGA GCCCCACA 


3318 


4393 


UGGGGCAU G UUAAGUUA 


991 


TAACTTAA GGCTAGCTACAACGA ATGCCCCA 


3319 


4398 


CAUGUUAA G UUACAGUU 


992 


AACTGTAA GGCTAGCTACAACGA TTAACATG 


3320 


4401 


GUUAAGUU A CAGUUUGC 


993 


GCAAACTG GGCTAGCTACAACGA AACTTAAC 


3321 


4404 


AAGUUACA G UUUGCACA 


994 


TGTGCAAA GGCTAGCTACAACGA TGTAACTT 


3322 


4408 


UACAGUUU G CACAAGUU 


995 


AACTTGTG GGCTAGCTACAACGA AAACTGTA 


3323 


4410 


CAGUUUGC A CAAGUUCA 


996 


TGAACTTG GGCTAGCTACAACGA GCAAACTG 


3324 


4414 


UUGCACAA G UUCAUCUC 


997 


GAGATGAA GGCTAGCTACAACGA TTGTGCAA 


3325 


4418 


ACAAGUUC A UCUCAUUU 


998 


AAATGAGA GGCTAGCTACAACGA GAACTTGT 


3326 


4423 


UUCAUCUC A UUUGUAUU 


999 


AATACAAA GGCTAGCTACAACGA GAGATGAA 


3327 


4427 


UCUCAUUU G UAUUCCAU 


1000- 


ATGGAATA GGCTAGCTACAACGA AAATGAGA 


3328 


4429 


UCAUUUGU A UUCCAUUG 


1001 


CAATGGAA GGCTAGCTACAACGA ACAAATGA 


3329 


4434 


UGUAUUCC A UUGAUUUU 


1002 


AAAATCAA GGCTAGCTACAACGA GGAATACA 


3330 


4438 


UUCCAUUG A UUUUUUUU 


1003 


AAAAAAAA GGCTAGCTACAACGA CAATGGAA 


3331 


4457 


UCUUCUAA A CAUUUUUU 


1004 


AAAAAATG GGCTAGCTACAACGA TTAGAAGA 


3332 


4459 


UUCUAAAC A UUUUUUCU 


1005 


AGAAAAAA GGCTAGCTACAACGA GTTTAGAA 


3333 


4473 


UCUUCAAA A CAGUAUAU 


1006 


ATATACTG GGCTAGCTACAACGA TTTGAAGA 


3334 


4476 


UCAAAACA G UAUAUAUA 


1007 


TATATATA GGCTAGCTACAACGA TGTTTTGA 


3335 


4478 


AAAACAGU A UAUAUAAC 


1008 


GTTATATA GGCTAGCTACAACGA ACTGTTTT 


3336 


4480 


AACAGUAU A UAUAACUU 


1009 


AAGTTATA GGCTAGCTACAACGA ATACTGTT 


3337 


4482 


CAGUAUAU A UAACUUUU 


1010 


AAAAGTTA GGCTAGCTACAACGA ATATACTG 


3338 


4485 


UAUAUAUA A CUUUUUUU 


1011 


AAAAAAAG GGCTAGCTACAACGA TATATATA 


3339 


4499 


UUUAGGGG A UUUUUUUU 


1012 


AAAAAAAA GGCTAGCTACAACGA CCCCTAAA 


3340 


4510 


UUUUUUAG A CAGCAAAA 


1013 


TTTTGCTG GGCTAGCTACAACGA CTAAAAAA 


3341 


4513 


UUUAGACA G CAAAAAAC 


1014 


GTTTTTTG GGCTAGCTACAACGA TGTCTAAA 


3342 


4520 


AGCAAAAA A CUAUCUGA 


1015 


TCAGATAG GGCTAGCTACAACGA TTTTTGCT 


3343 


4523 


AAAAAACU A UCUGAAGA 


1016 


TCTTCAGA GGCTAGCTACAACGA AGTTTTTT 


3344 


4531 


AUCUGAAG A TJUUCCAUU 


1017 


AATGGAAA GGCTAGCTACAACGA CTTCAGAT 


3345 


4537 


AGAUUUCC A UUUGUCAA 


1018 


TTGACAAA GGCTAGCTACAACGA GGAAATCT 


3346 


4541 


UUCCAUUU G UCAAAAAG 


1019 


CTTTTTGA GGCTAGCTACAACGA AAATGGAA 


3347 


4549 


GUCAAAAA G UAAUGAUU 


1020 


AATCATTA GGCTAGCTACAACGA TTTTTGAC 


3348 


4552 


AAAAAGUA A UGAUUUCU 


1021 


AGAAATCA GGCTAGCTACAACGA TACTTTTT 


3349 


4555 


AAGUAAUG A UUUCUUGA 


1022 


TCAAGAAA GGCTAGCTACAACGA CATTACTT 


3350 


4563 


AUUUCUUG A UAAUUGUG 


1023 


CACAATTA GGCTAGCTACAACGA CAAGAAAT 


3351 


4566 


UCUUGAUA A UUGUGUAG 


1024 


CTACACAA GGCTAGCTACAACGA TATCAAGA 


3352 


4569 


UGAUAAUU G UGUAGUGA 


1025 


TCACTACA GGCTAGCTACAACGA AATTATCA 


3353 


4571 


AUAAUUGU G UAGUGAAU 


1026 


ATTCACTA GGCTAGCTACAACGA ACAATTAT 


3354 


4574 


AUUGUGUA G UGAAUGUU 


1027 


AACATTCA GGCTAGCTACAACGA TACACAAT 


3355 


4578 


UGUAGUGA A UGUUUUUU 


1028 


AAAAAACA GGCTAGCTACAACGA TCACTACA 


3356 


4580 


UAGUGAAU G UUUUUUAG 


1029 


CTAAAAAA GGCTAGCTACAACGA ATTCACTA 


3357 


4590 


UUUUUAGA A CCCAGCAG 


1030 


CTGCTGGG GGCTAGCTACAACGA TCTAAAAA 


3358 


4595 


AGAACCCA G CAGUUACC 


1031 


GGTAACTG GGCTAGCTACAACGA TGGGTTCT 


3359 


4598 


ACCCAGCA G UUACCUUG 


1032 


CAAGGTAA GGCTAGCTACAACGA TGCTGGGT 


3360 


4601 


CAGCAGUU A CCUUGAAA 


1033 


TTTCAAGG GGCTAGCTACAACGA AACTGCTG 


3361 


4610 


CCUUGAAA G CUGAAUUU 


1034 


AAATTCAG GGCTAGCTACAACGA TTTCAAGG 


3362 


4615 


AAAGCUGA A UUUAUAUU 


1035 


AATATAAA GGCTAGCTACAACGA TCAGCTTT 


3363 


4619 


CUGAAUUU A UAUUUAGU 


1036 


ACTAAATA GGCTAGCTACAACGA AAATTCAG 


3364 
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4621 


GAAUUUAU A UUUAGUAA 


1037 


TTACTAAA GG CTAG CTAC AACGA ATAAATTC 


3365 


4626 


UAUAUUUA G UAACUUCU 


1038 


AGAAGTTA GGCTAGCTACAACGA TAAATATA 


3366 


4629 


AUUUAGUA A CUUCUGUG 


1039 


CACAGAAG GGCTAGCTACAACGA TACTAAAT 


3367 


4635 


UAACUUCU G UGUUAAUA 


1040 


TATTAACA GGCTAGCTACAACGA AGAAGTTA 


3368 


4637 


ACUUCUGU G UUAAUACU 


1041 


AGTATTAA GGCTAGCTACAACGA ACAGAAGT 


3369 


4641 


CUGUGUUA A UACUGGAU 


1042 


ATCCAGTA GGCTAGCTACAACGA TAACACAG 


3370 


4643 


GUGUUAAU A CUGGAUAG 


1043 


CTATCCAG GGCTAGCTACAACGA ATTAACAC 


3371 


4648 


AAUACUGG A UAGCAUGA 


1044 


TCATGCTA GGCTAGCTACAACGA CCAGTATT 


3372 


4651 


ACUGGAUA G CAUGAAUU 


1045 


AATTCATG GGCTAGCTACAACGA TATCCAGT 


3373 


4653 


UGGAUAGC A UGAAUUCU 


1046 


AGAATTCA GGCTAGCTACAACGA GCTATCCA 


3374 


4657 


UAGCAUGA A UUCUGCAU 


1047 


ATGCAGAA GGCTAGCTACAACGA TCATGCTA 


3375 


4662 


UGAAUUCU G CAUUGAGA 


1048 


TCTCAATG GGCTAGCTACAACGA AGAATTCA 


3376 


4664 


AAUUCUGC A UUGAGAAA 


1049 


TTTCTCAA GGCTAGCTACAACGA GCAGAATT 


3377 


4672 


AUUGAGAA A CUGAAUAG 


1050 


CTATTCAG GGCTAGCTACAACGA TTCTCAAT 


■3378 


4677 


GAAACUGA A UAGCUGUC 


1051 


GACAGCTA GGCTAGCTACAACGA TCAGTTTC 


3379 


4680 


ACUGAAUA G CUGUCAUA 


1052 


TATGACAG GGCTAGCTACAACGA TATTCAGT 


3380 


4683 


GAAUAGCU G UCAUAAAA 


1053 


TTTTATGA GGCTAGCTACAACGA AGCTATTC 


3381 


4686 


UAGCUGUC A UAAAAUGC 


1054 


GCATTTTA GGCTAGCTACAACGA GACAGCTA 


3382 


4691 


GUCAUAAA A UGCUUUCU 


1055 


AGAAAGCA GGCTAGCTACAACGA TTTATGAC 


3383 ! 


4693 


CAUAAAAU G CUUUCUUU 


1056 


AAAGAAAG GGCTAGCTACAACGA ATTTTATG 


3384 


4713 


AAAGAAAG A UACUCACA 


1057 


TGTGAGTA GGCTAGCTACAACGA CTTTCTTT 


3385 


4715 


AGAAAGAU A CUCACAUG 


1058 


CATGTGAG GGCTAGCTACAACGA ATCTTTCT 


3386 


4719 


AGAUACUC A CAUGAGUU 


1059 


AACTCATG GGCTAGCTACAACGA GAGTATCT 


3387 


4721 


AUACUCAC A UGAGUUCU 


1060 


AGAACTCA GGCTAGCTACAACGA GTGAGTAT 


3388 


4725 


UCACAUGA G UUCUUGAA 


1061 


TTCAAGAA GGCTAGCTACAACGA TCATGTGA 


3389 


4736 


CUUGAAGA A UAGUCAUA 


1062 


TATGACTA GGCTAGCTACAACGA TCTTCAAG 


3390 


4739 


GAAGAAUA G UCAUAACU 


1063 


AGTTATGA GGCTAGCTACAACGA TATTCTTC 


3391 


4742 


GAAUAGUC A UAACUAGA 


1064 


TCTAGTTA GGCTAGCTACAACGA GACTATTC 


3392 


4745 


UAGUCAUA A CUAGAUUA 


1065 


TAATCTAG GGCTAGCTACAACGA TATGACTA 


3393 


4750 


AUAACUAG A UUAAGAUC 


1066 


GATCTTAA GGCTAGCTACAACGA CTAGTTAT 


3394 


4756 


AGAUUAAG A UCUGUGUU 


1067 


AACACAGA GGCTAGCTACAACGA CTTAATCT 


3395 


4760 


UAAGAUCU G UGUUUUAG 


1068 


CTAAAACA GGCTAGCTACAACGA AGATCTTA 


3396 | 


4762 


AGAUCUGU G UUUUAGUU 


1069 


AACTAAAA GGCTAGCTACAACGA ACAGATCT 


3397 


4768 


GUGUUUUA G UUUAAUAG 


1070 


CTATTAAA GGCTAGCTACAACGA TAAAACAC 


3398 


4773 


UUAGUUUA A UAGUUUGA 


1071 


TCAAACTA GGCTAGCTACAACGA TAAACTAA 


3399 


4776 


GUUUAAUA G UUUGAAGU 


1072 


ACTTCAAA GGCTAGCTACAACGA TATTAAAC 


3400 


4783 


AGUUUGAA G UGCCUGUU 


1073 


AACAGGCA GGCTAGCTACAACGA TTCAAACT i 


3401 


4785 


UUUGAAGU G CCUGUUUG 


1074 


CAAACAGG GGCTAGCTACAACGA ACTTCAAA 


3402 


4789 


AAGUGCCU G UUUGGGAU 


1075 


ATCCCAAA GGCTAGCTACAACGA AGGCACTT 


3403 


4796 


UGUUUGGG A UAAUGAUA 


1076 


TATCATTA GGCTAGCTACAACGA CCCAAACA 


3404 


4799 


UUGGGAUA A UGAUAGGU 


1077 


ACCTATCA GGCTAGCTACAACGA TATCCCAA 


3405 


4802 


GGAUAAUG A UAGGUAAU 


1078 


ATTACCTA GGCTAGCTACAACGA CATTATCC 


3406 


4806 


AAUGAUAG G UAAUUUAG 


1079 


CTAAATTA GGCTAGCTACAACGA CTATCATT 


3407 


4809 


GAUAGGUA A UUUAGAUG 


1080 


CATCTAAA GGCTAGCTACAACGA TACCTATC 


3408 


4815 


UAAUUUAG A UGAAUUUA 


1081 


TAAATTCA GGCTAGCTACAACGA CTAAATTA 


3409 


4819 


UUAGAUGA A UUUAGGGG 


1082 


CCCCTAAA GGCTAGCTACAACGA TCATCTAA 


3410 


4836 


AAAAAAAA G UUAUCUGC 


1083 


GCAGATAA GGCTAGCTACAACGA TTTTTTTT 


3411 


4839 


AAAAAGUU A UCUGCAGU 


1084 


ACTGCAGA GGCTAGCTACAACGA AACTTTTT 


3412 


4843 


AGUUAUCU G CAGUUAUG 


1085 


CATAACTG GGCTAGCTACAACGA AGATAACT 


3413 


4846 


UAUCUGCA G UUAUGUUG 


1086 


CAACATAA GGCTAGCTACAACGA TGCAGATA 


3414 


4849 


CUGCAGUU A UGUUGAGG 


1087 


CCTCAACA GGCTAGCTACAACGA AACTGCAG 


3415 


4851 


GCAGUUAU G UUGAGGGC 


1088 


GCCCTCAA GGCTAGCTACAACGA ATAACTGC 


3416 
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4858 


UGUUGAGG G CCCAUCUC 


1089 


GAGATGGG GGCTAGCTACAACGA CCTCAACA 


3417 


4862 


GAGGGCCC A UCUCUCCC 


1090 


GGGAGAGA GGCTAGCTACAACGA GGGCCCTC 


3418 


4874 


CUCCCCCC A CACCCCCA 


1091 


TGGGGGTG GGCTAGCTACAACGA GGGGGGAG 


3419 


4876 


CCCCCCAC A CCCCCACA 


1092 


TGTGGGGG GGCTAGCTACAACGA GTGGGGGG 


3420 


4882 


ACACCCCC A CAGAGCUA 


1093 


TAGCTCTG GGCTAGCTACAACGA GGGGGTGT 


3421 


4887 


CCCACAGA G CUAACUGG 


1094 


CCAGTTAG GGCTAGCTACAACGA TCTGTGGG 


3422 


4891 


CAGAGCUA A CUGGGUUA 


1095 


TAACCCAG GGCTAGCTACAACGA TAGCTCTG 


3423 


4896 


CUAACUGG G UUACAGUG 


1096 


CACTGTAA GGCTAGCTACAACGA CCAGTTAG 


3424 


4899 


ACUGGGUU A CAGUGUUU 


1097 


AAACACTG GGCTAGCTACAACGA AACCCAGT 


3425 


4902 


GGGUUACA G UGUUUUAU 


1098 


ATAAAACA GGCTAGCTACAACGA TGTAACCC 


3426 


4904 


GUUACAGU G UUUUAUCC 


1099 


GGATAAAA GGCTAGCTACAACGA ACTGTAAC 


3427 


4909 


AGUGUUUU A UCCGAAAG 


1100 


CTTTCGGA GGCTAGCTACAACGA AAAACACT 


3428 


4917 


AUCCGAAA G UUUCCAAU 


1101 


. ATTGGAAA GGCTAGCTACAACGA TTTCGGAT 


3429 


4924 


AGUUUCCA A UUCCACUG 


1102 


CAGTGGAA GGCTAGCTACAACGA TGGAAACT 


3430 


4929 


CCAAUUCC A CUGUCUUG 


1103 


CAAGACAG GGCTAGCTACAACGA GGAATTGG 


3431 


4932 


AUUCCACU G UCUUGUGU 


1104 


ACACAAGA GGCTAGCTACAACGA AGTGGAAT 


3432 


4937 


ACUGUCUU G UGUUUUCA 


1105 


TGAAAACA GGCTAGCTACAACGA AAGACAGT 


3433 


4939 


UGUCUUGU G UUUUCAUG 


1106 


CATGAAAA GGCTAGCTACAACGA ACAAGACA 


3434 


4945 


GUGUUUUC A UGUUGAAA 


1107 


TTTCAACA GGCTAGCTACAACGA GAAAACAC 


3435 


4947 


GUUUUCAU G UUGAAAAU 


1108 


ATTTTCAA GGCTAGCTACAACGA ATGAAAAC 


3436 


4954 


UGUUGAAA A UACUUUUG 


1109 


CAAAAGTA GGCTAGCTACAACGA TTTCAACA 


3437 


4956 


UUGAAAAU A CUUUUGCA 


1110 


TGCAAAAG GGCTAGCTACAACGA ATTTTCAA 


3438 


4962 


AUACUUUU G CAUUUUUC 


1111 


GAAAAATG GGCTAGCTACAACGA AAAAGTAT 


3439 


4964 


ACUUUUGC A UUUUUCCU 


1112 


AGGAAAAA GGCTAGCTACAACGA GCAAAAGT 


3440 


4977 


UCCUUUGA G UGCCAAUU 


1113 


AATTGGCA GGCTAGCTACAACGA TCAAAGGA 


3441 


4979 


CUUUGAGU G CCAAUUUC 


1114 


GAAATTGG GGCTAGCTACAACGA ACTCAAAG 


3442 


4983 


GAGUGCCA A UUUCUUAC 


1115 


GTAAGAAA GGCTAGCTACAACGA TGGCACTC 


3443 


4990 


AAUUUCUU A CUAGUACU 


1116 


AGTACTAG GGCTAGCTACAACGA AAGAAATT 


3444 


4994 


UCUUACUA G UACUAUUU 


1117 


AAATAGTA GGCTAGCTACAACGA TAGTAAGA 


3445 


4996 


UUACUAGU A CUAUUUCU 


1118 


AGAAATAG GGCTAGCTACAACGA ACTAGTAA 


3446 


4999 


CUAGUACU A UUUCUUAA 


1119 


TTAAGAAA GGCTAGCTACAACGA AGTACTAG 


3447 


5007 


AUUUCUUA A UGUAACAU 


1120 


ATGTTACA GGCTAGCTACAACGA TAAGAAAT 


3448 


5009 


UUCUUAAU G UAACAUGU 


1121 


ACATGTTA GGCTAGCTACAACGA ATTAAGAA 


3449 


5012 


UUAAUGUA A CAUGUUUA 


1122 


TAAACATG GGCTAGCTACAACGA TACATTAA 


3450 


5014 


AAUGUAAC A UGUUUACC 


1123 


GGTAAACA GGCTAGCTACAACGA GTTACATT 


3451 


5016 


UGUAACAU G UUUACCUG 


1124 


CAGGTAAA GGCTAGCTACAACGA ATGTTACA 


3452 


5020 


ACAUGUUU A CCUGGCCU 


1125 


AGGCCAGG GGCTAGCTACAACGA AAACATGT 


3453 


5025 


UUUACCUG G CCUGUCUU 


1126 


AAGACAGG GGCTAGCTACAACGA CAGGTAAA 


3454 


5029 


CCUGGCCU G UCUUUUAA 


1127 


TTAAAAGA GGCTAGCTACAACGA AGGCCAGG 


3455 


5037 


GUCUUUUA A CUAUUUUU 


1128 


AAAAATAG GGCTAGCTACAACGA TAAAAGAC 


3456 


5040 


UUUUAACU A UUUUUGUA 


1129 


TACAAAAA GGCTAGCTACAACGA AGTTAAAA 


3457 


5046 


CUAUUUUU G UAUAGUGU 


1130 


ACACTATA GGCTAGCTACAACGA AAAAATAG 


3458 


5048 


AUUUUUGU A UAGUGUAA 


1131 


TTACACTA GGCTAGCTACAACGA ACAAAAAT 


3459 


5051 


UUUGUAUA G UGUAAACU 


1132 


AGTTTACA GGCTAGCTACAACGA TATACAAA 


3460 


5053 


UGUAUAGU G UAAACUGA 


1133 


TCAGTTTA GGCTAGCTACAACGA ACTATACA 


3461 


5057 


UAGUGUAA A CUGAAACA 


1134 


TGTTTCAG GGCTAGCTACAACGA TTACACTA 


3462 


5063 


AAACUGAA A CAUGCACA 


1135 


TGTGCATG GGCTAGCTACAACGA TTCAGTTT 


3463 


5065 


ACUGAAAC A UGCACAUU 


1136 


AATGTGCA GGCTAGCTACAACGA GTTTCAGT 


3464 


5067 


UGAAACAU G CACAUUUU 


1137 


AAAATGTG GGCTAGCTACAACGA ATGTTTCA 


3465 


5069 


AAACAUGC A CAUUUUGU 


1138 


ACAAAATG GGCTAGCTACAACGA GCATGTTT 


3466 


5071 


ACAUGCAC A UUUUGUAC 


1139 


GTACAAAA GGCTAGCTACAACGA GTGCATGT 


3467 


5076 


CACAUUUU G UACAUUGU 


1140 


ACAATGTA GGCTAGCTACAACGA AAAATGTG 


3468 
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5078 


CAUUUUGU A CAUUGUGC 


1141 


GCACAATG GGCTAGCTACAACGA ACAAAATG 


3469 


5080 


UUUUGUAC A UUGUGCUU 


1142 


AAGCACAA GGCTAGCTACAACGA GTACAAAA 


3470 


5083 


UGUACAUU G UGCUUUCU 


1143 


AGAAAGCA GGCTAGCTACAACGA AATGTACA 


3471 


5085 


UACAUUGU G CUUUCUUU 


1144 


AAAGAAAG GGCTAGCTACAACGA ACAATGTA 


3472 


5095 


UUUCUUUU G UGGGUCAU 


1145 


ATGACCCA GGCTAGCTACAACGA AAAAGAAA 


3473 


5099 


UUUUGUGG G UCAUAUGC 


1146 


GCATATGA GGCTAGCTACAACGA CCACAAAA 


3474 


5102 


UGUGGGUC A UAUGCAGU 


1147 


ACTGCATA GGCTAGCTACAACGA GACCCACA 


3475 


5104 


UGGGUCAU A UGCAGUGU 


1148 


ACACTGCA GGCTAGCTACAACGA ATGACCCA 


3476 


5106 


GGUCAUAU G CAGVGVGA 


1149 


TCACACTG GGCTAGCTACAACGA ATATGACC 


3477 


5109 


CAUAUGCA G UGUGAUCC 


1150 


GGATCACA GGCTAGCTACAACGA TGCATATG 


3478 


5111 


UAUGCAGU G UGAUCCAG 


1151 


CTGGATCA GGCTAGCTACAACGA ACTGCATA 


3479 


5114 


GCAGUGUG A UCCAGUUG 


1152 


CAACTGGA GGCTAGCTACAACGA CACACTGC 


3480 


5119 


GUGAUCCA G UUGUUUUC 


1153 


GAAAACAA GGCTAGCTACAACGA TGGATCAC 


3481 


5122 


AUCCAGUU G UUUUCCAU 


1154 


ATGGAAAA GGCTAGCTACAACGA AACTGGAT 


3482 


5129 


UGUUUUCC A UCAUUUGG 


1155 


CCAAATGA GGCTAGCTACAACGA GGAAAACA 


3483 


5132 


UUUCCAUC A UUUGGUUG 


1156 


CAACCAAA GGCTAGCTACAACGA GATGGAAA 


3484 


5137 


AUCAUUUG G UUGCGCUG 


1157 


CAGCGCAA GGCTAGCTACAACGA CAAATGAT 


3485 


5140 


AUUUGGUU G CGCUGACC 


1158 


GGTCAGCG GGCTAGCTACAACGA AACCAAAT 


3486 


5142 


UUGGUUGC G CUGACCUA 


1159 


TAGGTCAG GGCTAGCTACAACGA GCAACCAA 


3487 


5146 


UUGCGCUG A CCUAGGAA 


1160 


TTCCTAGG GGCTAGCTACAACGA CAGCGCAA 


3488 


5154 


ACCUAGGA A UGUUGGUC 


1161 


GACCAACA GGCTAGCTACAACGA TCCTAGGT 


3489 


5156 


CUAGGAAU G UUGGUCAU 


1162 


ATGACCAA GGCTAGCTACAACGA ATTCCTAG 


3490 


5160 


GAAUGUUG G UCAUAUCA 


1163 


TGATATGA GGCTAGCTACAACGA CAACATTC 


3491 


5163 


UGUUGGUC A UAUCAAAC 


1164 


GTTTGATA GGCTAGCTACAACGA GACCAACA 


3492 


5165 


UUGGUCAU A UCAAACAU 


1165 


ATGTTTGA GGCTAGCTACAACGA ATGACCAA 


3493 


5170 


CAUAUCAA A CAUUAAAA 


1166 


TTTTAATG GGCTAGCTACAACGA TTGATATG 


3494 


5172 


UAUCAAAC A UUAAAAAU 


1167 


ATTTTTAA GGCTAGCTACAACGA GTTTGATA 


3495 


5179 


CAUUAAAA A UGACCACU 


1168 


AGTGGTCA GGCTAGCTACAACGA TTTTAATG 


3496 


5182 


UAAAAAUG A CCACUCUU 


1169 


AAGAGTGG GGCTAGCTACAACGA CATTTTTA 


3497 


5185 


AAAUGACC A CUCUUUUA 


1170 


TAAAAGAG GGCTAGCTACAACGA GGTCATTT 


3498 


5194 


CUCUUUUA A UGAAAUUA 


1171 


TAATTTCA GGCTAGCTACAACGA TAAAAGAG 


3499 


5199 


UUAAUGAA A UUAACUUU 


1172 


AAAGTTAA GGCTAGCTACAACGA TTCATTAA 


3500 


5203 


UGAAAUUA A CUUUUAAA 


1173 


TTTAAAAG GGCTAGCTACAACGA TAATTTCA 


3501 


5211 


ACUUUUAA A UGUUUAUA 


1174 


TATAAACA GGCTAGCTACAACGA TTAAAAGT 


3502 


5213 


UUUUAAAU G UUUAUAGG 


1175 


CCTATAAA GGCTAGCTACAACGA ATTTAAAA 


3503 


5217 


AAAUGUUU A UAGGAGUA 


1176 


TACTCCTA GGCTAGCTACAACGA AAACATTT 


3504 


5223 


UUAUAGGA G UAUGUGCU 


1177 


AGCACATA GGCTAGCTACAACGA TCCTATAA 


3505 


5225 


AUAGGAGU A UGUGCUGU 


1178 


ACAGCACA GGCTAGCTACAACGA ACTCCTAT 


3506 


5227 


AGGAGUAU G UGCUGUGA 


1179 


TCACAGCA GGCTAGCTACAACGA ATACTCCT 


3507 


5229 


GAGUAUGU G CUGUGAAG 


1180 


CTTCACAG GGCTAGCTACAACGA ACATACTC 


3508 


5232 


UAUGUGCU G UGAAGUGA 


1181 


TCACTTCA GGCTAGCTACAACGA AGCACATA 


3509 


5237 


GCUGUGAA G UGAUCUAA 


1182 


TTAGATCA GGCTAGCTACAACGA TTCACAGC 


3510 


5240 


GUGAAGUG A UCUAAAAU 


1183 


ATTTTAGA GGCTAGCTACAACGA CACTTCAC 


3511 


5247 


GAUCUAAA A UUUGUAAU 


1184 


ATTACAAA GGCTAGCTACAACGA TTTAGATC 


3512 


5251 


UAAAAUUU G UAAUAUUU 


1185 


AAATATTA GGCTAGCTACAACGA AAATTTTA 


3513 


5254 


AAUUUGUA A UAUUUUUG 


1186 


CAAAAATA GGCTAGCTACAACGA TACAAATT 


3514 


5256 


UUUGUAAU A UUUUUGUC 


1187 


GACAAAAA GGCTAGCTACAACGA ATTACAAA 


3515 


5262 


AUAUUUUU G UCAUGAAC 


1188 


GTTCATGA GGCTAGCTACAACGA AAAAATAT 


3516 


5265 


UUUUUGUC A UGAACUGU 


1189 


ACAGTTCA GGCTAGCTACAACGA GACAAAAA 


3517 


5269 


UGUCAUGA A CUGUACUA 


1190 


TAGTACAG GGCTAGCTACAACGA TCATGACA 


3518 


5272 


CAUGAACU G UACUACUC 


1191 


GAGTAGTA GGCTAGCTACAACGA AGTTCATG 


3519 


5274 


UGAACUGU A CUACUCCU 


1192 


AGGAGTAG GGCTAGCTACAACGA ACAGTTCA 


3520 
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5277 


ACUGUACU A CUCCUAAU 


1193 


ATTAGGAG GGCTAGCTACAACGA AGTACAGT 


3521 


5284 


UACUCCUA A UUAUUGUA 


1194 


TACAATAA GGCTAGCTACAACGA TAGGAGTA 


3522 


5287 


UCCUAAUU A UUGUAAUG 


1195 


CATTACAA GGCTAGCTACAACGA AATTAGGA 


3523 


5290 


UAAUUAUU G UAAUGUAA 


1196 


TTACATTA GGCTAGCTACAACGA AATAATTA 


3524 


5293 


UUAUUGUA A UGUAAUAA 


1197 


TTATTACA GGCTAGCTACAACGA TACAATAA 


3525 


5295 


AUUGUAAU G UAAUAAAA 


1198 


TTTTATTA GGCTAGCTACAACGA ATTACAAT 


3526 


5298 


GUAAUGUA A UAAAAAUA 


1199 


TATTTTTA GGCTAGCTACAACGA TACATTAC 


3527 


5304 


UAAUAAAA A UAGUUACA 


1200 


TGTAACTA GGCTAGCTACAACGA TTTTATTA 


3528 


5307 


UAAAAAUA G UUACAGUG 


1201 


CACTGTAA GGCTAGCTACAACGA TATTTTTA 


3529 


5310 


AAAUAGUU A CAGUGACU 


1202 


AGTCACTG GGCTAGCTACAACGA AACTATTT 


3530 


5313 


UAGUUACA G UGACUAUG 


1203 


CATAGTCA GGCTAGCTACAACGA TGTAACTA 


3531 


5316 


UUACAGUG A CUAUGAGU 


1204 


ACTCATAG GGCTAGCTACAACGA CACTGTAA 


3532 


5319 


CAGUGACU A UGAGUGUG 


1205 


CACACTCA GGCTAGCTACAACGA AGTCACTG 


3533 


5323 


GACUAUGA G UGUGUAUU 


1206 


AATACACA GGCTAGCTACAACGA TCATAGTC 


3534 


5325 


CUAUGAGU G UGUAUUUA 


1207 


TAAATACA GGCTAGCTACAACGA ACTCATAG 


3535 


5327 


AUGAGUGU G UAUUUAUU 


1208 


AATAAATA GGCTAGCTACAACGA ACACTCAT 


3536 


5329 


GAGUGUGU A UUUAUUCA 


1209 


TGAATAAA GGCTAGCTACAACGA ACACACTC 


3537 


5333 


GUGUAUUU A UUCAUGCA 


1210 


TGCATGAA GGCTAGCTACAACGA AAATACAC 


3538 


5337 


AUUUAUUC A UGCAAAUU 


1211 


AATTTGCA GGCTAGCTACAACGA GAATAAAT 


3539 


5339 


UUAUUCAU G CAAAUUUG 


1212 


CAAATTTG GGCTAGCTACAACGA ATGAATAA 


3540 


5343 


UCAUGCAA A UUUGAACU 


1213 


AGTTCAAA GGCTAGCTACAACGA TTGCATGA 


3541 


5349 


AAAUUUGA A CUGUUUGC 


1214 


GCAAACAG GGCTAGCTACAACGA TCAAATTT 


3542 


5352 


UUUGAACU G UUUGCCCC 


1215 


GGGGCAAA GGCTAGCTACAACGA AGTTCAAA 


3543 


5356 


AACUGUUU G CCCCGAAA 


1216 


TTTCGGGG GGCTAGCTACAACGA AAACAGTT 


3544 


5364 


GCCCCGAA A UGGAUAUG 


1217 


CATATCCA GGCTAGCTACAACGA TTCGGGGC 


3545 


5368 


CGAAAUGG A UAUGGAUA 


1218 


TATCCATA GGCTAGCTACAACGA CCATTTCG 


3546 


5370 


AAAUGGAU A UGGAUACU 


1219 


AGTATCCA GGCTAGCTACAACGA ATCCATTT 


3547 


5374 


GGAUAUGG A UACUUUAU 


1220 


ATAAAGTA GGCTAGCTACAACGA CCATATCC 


3548 


5376 


AUAUGGAU A CUUUAUAA 


1221 


TTATAAAG GGCTAGCTACAACGA ATCCATAT 


3549 


5381 


GAUACUUU A UAAGCCAU 


1222 


ATGGCTTA GGCTAGCTACAACGA AAAGTATC 


3550 


5385 


CUUUAUAA G CCAUAGAC 


1223 


GTCTATGG GGCTAGCTACAACGA TTATAAAG 


3551 


5388 


UAUAAGCC A UAGACACU 


1224 


^ AGTGTCTA GGCTAGCTACAACGA GGCTTATA 


3552 


5392 


AGCCAUAG A CACUAUAG 


1225 


CTATAGTG GGCTAGCTACAACGA CTATGGCT 


3553 


5394 


CCAUAGAC A CUAUAGUA 


1226 


TACTATAG GGCTAGCTACAACGA GTCTATGG 


3554 


5397 


UAGACACU A UAGUAUAC 


1227 


GTATACTA GGCTAGCTACAACGA AGTGTCTA 


3555 


5400 


ACACUAUA G UAUACCAG 


1228 


CTGGTATA GGCTAGCTACAACGA TATAGTGT 


3556 


5402 


ACUAUAGU A UACCAGUG 


1229 


CACTGGTA GGCTAGCTACAACGA ACTATAGT 


3557 


5404 


UAUAGUAU A CCAGUGAA 


1230 


TTCACTGG GGCTAGCTACAACGA ATACTATA 


3558 


5408 


GUAUACCA G UGAAUCUU 


1231 


AAGATTCA GGCTAGCTACAACGA TGGTATAC 


3559 


5412 


ACCAGUGA A UCUUUUAU 


1232 


ATAAAAGA GGCTAGCTACAACGA TCACTGGT 


3560 


5419 


AAUCUUUU A UGCAGCUU 


1233 


AAGCTGCA GGCTAGCTACAACGA AAAAGATT 


3561 


5421 


UCUUUUAU G CAGCUUGU 


1234 


ACAAGCTG GGCTAGCTACAACGA ATAAAAGA 


3562 


5424 


UUUAUGCA G CUUGUUAG 


1235 


CTAACAAG GGCTAGCTACAACGA TGCATAAA 


3563 


5428 


UGCAGCUU G UUAGAAGU 


1236 


ACTTCTAA GGCTAGCTACAACGA AAGCTGCA 


3564 


5435 


UGUUAGAA G UAUCCUUU 


1237 


AAAGGATA GGCTAGCTACAACGA TTCTAACA 


3565 


5437 


UUAGAAGU A UCCUUUUA 


1238 


TAAAAGGA GGCTAGCTACAACGA ACTTCTAA 


3566 


5445 


AUCCUUUU A UUUUCUAA 


1239 


TTAGAAAA GGCTAGCTACAACGA AAAAGGAT 


3567 


5457 


UCUAAAAG G UGCUGUGG 


1240 


CCACAGCA GGCTAGCTACAACGA CTTTTAGA 


3568 


5459 


UAAAAGGU G CUGUGGAU 


1241 


ATCCACAG GGCTAGCTACAACGA ACCTTTTA 


3569 


5462 


AAGGUGCU G UGGAUAUU 


1242 


AATATCCA GGCTAGCTACAACGA AGCACCTT 


3570 


5466 


UGCUGUGG A UAUUAUGU 


1243 


ACATAATA GGCTAGCTACAACGA CCACAGCA 


3571 


5468 


CUGUGGAU A UUAUGUAA 


1244 


TTACATAA GGCTAGCTACAACGA ATCCACAG 


3572 
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5471 


UGGAUAUU A UGUAAAGG 


1245 


CCTTTACA GG CTAGCT ACAACGA AATATCCA 


3573 


5473 


GAUAUUAU G UAAAGGCG 


1246 


CGCCTTTA GGCTAGCTACAACGA ATAATATC 


3574 


5479 


AUGUAAAG G CGUGUUUG 


1247 


CAAACACG GGCTAGCTACAACGA CTTTACAT 


3575 


5481 


GUAAAGGC G UGUUUGCU 


1248 


AGCAAACA GGCTAGCTACAACGA GCCTTTAC 


3576 


5483 


AAAGGCGU G UUUGCUUA 


1249 


TAAGCAAA GGCTAGCTACAACGA ACGCCTTT 


3577 


5487 


GCGUGUUU G CUUAAACA 


1250 


TGTTTAAG GGCTAGCTACAACGA AAACACGC 


3578 


5493 


UUGCUUAA A CAAUUUUC 


1251 


GAAAATTG GGCTAGCTACAACGA TTAAGCAA 


3579 


5496 


CUUAAACA A UUUUCCAU 


1252 


ATGGAAAA GGCTAGCTACAACGA TGTTTAAG 


3580 


5503 


AAUUUUCC A UAUUUAGA 


1253 


TCTAAATA GGCTAGCTACAACGA GGAAAATT 


3581 


5505 


UUUUCCAU A UUUAGAAG 


1254 


CTTCTAAA GGCTAGCTACAACGA ATGGAAAA 


3582 


5513 


AUUUAGAA G UAGAUGCA 


1255 


TGCATCTA GGCTAGCTACAACGA TTCTAAAT 


3583 


5517 


AGAAGUAG A UGCAAAAC 


1256 


GTTTTGCA GGCTAGCTACAACGA CTACTTCT 


3584 


5519 


AAGUAGAU G CAAAACAA 


1257 


TTGTTTTG GGCTAGCTACAACGA ATCTACTT 


3585 


5524 


GAUGCAAA A CAAAUCUG 


1258 


CAGATTTG GGCTAGCTACAACGA TTTGCATC 


3586 


5528 


CAAAACAA A UCUGCCUU 


1259 


AAGGCAGA GGCTAGCTACAACGA TTGTTTTG 


3587 


5532 


ACAAAUCU G CCUUUAUG 


1260 


CATAAAGG GGCTAGCTACAACGA AGATTTGT 


3588 


5538 


CUGCCUUU A UGACAAAA 


1261 


TTTTGTCA GGCTAGCTACAACGA AAAGGCAG 


3589 


5541 


CCUUUAUG A CAAAAAAA 


1262 


TTTTTTTG GGCTAGCTACAACGA CATAAAGG 


3590 


5549 


ACAAAAAA A UAGGAUAA 


1263 


TTATCCTA GGCTAGCTACAACGA TTTTTTGT 


3591 


5554 


AAAAUAGG A UAACAUUA 


1264 


TAATGTTA GGCTAGCTACAACGA CCTATTTT 


3592 


5557 


AUAGGAUA A CAUUAUUU 


1265 


AAATAATG GGCTAGCTACAACGA TATCCTAT 


3593 


5559 


AGGAUAAC A UUAUUUAU 


1266 


ATAAATAA GGCTAGCTACAACGA GTTATCCT 


3594 


5562 


AUAACAUU A UUUAUUUA 


1267 


TAAATAAA GGCTAGCTACAACGA AATGTTAT 


3595 


5566 


CAUUAUUU A UUUAUUUC 


1268 


GAAATAAA GGCTAGCTACAACGA AAATAATG 


3596 


5570 


AUUUAUUU A UUUCCUUU 


1269 


AAAGGAAA GGCTAGCTACAACGA AAATAAAT 


3597 


5580 


UUCCUUUU A UCAAUAAG 


1270 


CTTATTGA GGCTAGCTACAACGA AAAAGGAA 


359B 


5584 


UUUUAUCA A UAAGGUAA 


1271 


TTACCTTA GGCTAGCTACAACGA TGATAAAA 


3599 


5589 


UCAAUAAG G UAAUUGAU 


1272 


ATCAATTA GGCTAGCTACAACGA CTTATTGA 


3600 


5592 


AUAAGGUA A UUGAUACA 


1273 


TGTATCAA GGCTAGCTACAACGA TACCTTAT 


3601 


5596 


GGUAAUUG A UACACAAC 


1274 


GTTGTGTA GGCTAGCTACAACGA CAATTACC 


3602 


5598 


UAAUUGAU A CACAACAG 


1275 


CTGTTGTG GGCTAGCTACAACGA ATCAATTA 


3603 


5600 


AUUGAUAC A CAACAGGU 


1276 


ACCTGTTG GGCTAGCTACAACGA GTATCAAT 


3604 


5603 


GAUACACA A CAGGUGAC 


1277 


GTCACCTG GGCTAGCTACAACGA TGTGTATC 


3605 


5607 


CACAACAG G UGACUUGG 


1278 


CCAAGTCA GGCTAGCTACAACGA CTGTTGTG 


3606 


5610 


AACAGGUG A CUUGGUUU 


1279 


AAACCAAG GGCTAGCTACAACGA CACCTGTT 


3607 


5615 


GUGACUUG G UUUUAGGC 


1280 


GCCTAAAA GGCTAGCTACAACGA CAAGTCAC 


3608 


5622 


GGUUUUAG G CCCAAAGG 


1281 


CCTTTGGG GGCTAGCTACAACGA CTAAAACC 


3609 


5630 


GCCCAAAG G UAGCAGCA 


1282 


TGCTGCTA GGCTAGCTACAACGA CTTTGGGC 


3610 


5633 


CAAAGGUA G CAGCAGCA 


1283 


TGCTGCTG GGCTAGCTACAACGA TACCTTTG 


3611 


5636 


AGGUAGCA G CAGCAACA 


1284 


TGTTGCTG GGCTAGCTACAACGA TGCTACCT 


3612 


5639 


UAGCAGCA G CAACAUUA 


1285 


TAATGTTG GGCTAGCTACAACGA TGCTGCTA 


3613 


5642 


CAGCAGCA A CAUUAAUA 


1286 


TATTAATG GGCTAGCTACAACGA TGCTGCTG 


3614 


5644 


GCAGCAAC A UUAAUAAU 


1287 


ATTATTAA GGCTAGCTACAACGA GTTGCTGC 


3615 


5648 


CAACAUUA A UAAUGGAA 


1288 


TTCCATTA GGCTAGCTACAACGA TAATGTTG 


3616 


5651 


CAUUAAUA A UGGAAAUA 


1289 


TATTTCCA GGCTAGCTACAACGA TATTAATG 


3617 


5657 


UAAUGGAA A UAAUUGAA 


1290 


TTCAATTA GGCTAGCTACAACGA TTCCATTA 


3618 


5660 


UGGAAAUA A UUGAAUAG 


1291 


CTATTCAA GGCTAGCTACAACGA TATTTCCA 


3619 


5665 


AUAAUUGA A UAGUUAGU 


1292 


ACTAACTA GGCTAGCTACAACGA TCAATTAT 


3620 


5668 


AUUGAAUA G UUAGUUAU 


1293 


ATAACTAA GGCTAGCTACAACGA TATTCAAT 


3621 


5672 


AAUAGUUA G UUAUGUAU 


1294 


ATACATAA GGCTAGCTACAACGA TAACTATT 


3622 


5675 


AGUUAGUU A UGUAUGUU 


1295 


AACATACA GGCTAGCTACAACGA AACTAACT 


3623 


5677 


UUAGUUAU G UAUGUUAA 


1296 


TTAACATA GGCTAGCTACAACGA ATAACTAA 


3624 
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5679 


AGUUAUGU A UGUUAAUG 


1297 


CATTAACA GGCTAGCTACAACGA ACATAACT 


3625 


5681 


UUAUGUAU G UUAAUGCC 


1298 


GGCATTAA GGCTAGCTACAACGA ATACATAA 


3626 


5685 


GUAUGUUA A UGCCAGUC 


1299 


GACTGGCA GGCTAGCTACAACGA TAACATAC 


3627 


5687 


AUGU0AAU G CCAGUCAC 


1300 


GTGACTGG GGCTAGCTACAACGA ATTAACAT 


3628 


5691 


UAAUGCCA G UCACCAGC 


1301 


GCTGGTGA GGCTAGCTACAACGA TGGCATTA 


3629 


5694 


UGCCAGUC A CCAGCAGG 


1302 


CCTGCTGG GGCTAGCTACAACGA GACTGGCA 


3630 


5698 


AGUCACCA G CAGGCUAU 


1303 


ATAGCCTG GGCTAGCTACAACGA TGGTGACT 


3631 


5702 


ACCAGCAG G CUAUUUCA 


1304 


TGAAATAG GGCTAGCTACAACGA CTGCTGGT 


3632 


5705 


AGCAGGCU A UUUCAAGG 


1305 


CCTTGAAA GGCTAGCTACAACGA AGCCTGCT 


3633 


5713 


AUUUCAAG G UCAGAAGU 


1306 


ACTTCTGA GGCTAGCTACAACGA CTTGAAAT 


3634 


5720 


GGUCAGAA G UAAUGACU 


1307 


AGTCATTA GGCTAGCTACAACGA TTCTGACC 


3635 


5723 


CAGAAGUA A UGACUCCA 


1308 


TGGAGTCA GGCTAGCTACAACGA TACTTCTG 


3636 


5726 


AAGUAAUG A CUCCAUAC 


1309 


GTATGGAG GGCTAGCTACAACGA CATTACTT 


3637 


5731 


AUGACUCC A UACAUAUU 


1310 


AATATGTA GGCTAGCTACAACGA GGAGTCAT 


3638 


5733 


GACUCCAU A CAUAUUAU 


1311 


ATAATATG GGCTAGCTACAACGA ATGGAGTC 


3639 


5735 


CUCCAUAC A UAUUAUUU 


1312 


AAATAATA GGCTAGCTACAACGA GTATGGAG 


3640 


5737 


CCAUACAU A UUAUUUAU 


1313 


ATAAATAA GGCTAGCTACAACGA ATGTATGG 


3641 


5740 


UACAUAUU A UUUAUUUC 


1314 


GAAATAAA GGCTAGCTACAACGA AATATGTA 


3642 


5744 


UAUUAUUU A UUUCUAUA 


1315 


TATAGAAA GGCTAGCTACAACGA AAATAATA 


3643 


5750 


UUAUUUCU A UAACUACA 


1316 


TGTAGTTA GGCTAGCTACAACGA AGAAATAA 


3644 


5753 


UUUCUAUA A CUACAUUU 


1317 


AAATGTAG GGCTAGCTACAACGA TATAGAAA 


3645 


5756 


CUAUAACU A CAUUUAAA 


1318 


TTTAAATG GGCTAGCTACAACGA AGTTATAG 


3646 


5758 


AUAACUAC A UUUAAAUC 


1319 


GATTTAAA GGCTAGCTACAACGA GTAGTTAT 


3647 


5764 


ACAUUUAA A UCAUUACC 


1320 


GGTAATGA GGCTAGCTACAACGA TTAAATGT 


3648 


5767 


UUUAAAUC A UUACCAGG 


1321 


CCTGGTAA GGCTAGCTACAACGA GATTTAAA 


3649 



Input Sequence = NM_004985. Cut Site = R/Y 

Arm Length = 8 . Core Sequence - GGCTAGCTACAACGA 

NM_004985 {Homo sapiens v-Ki-ras2 Kirsten rat sarcoma 2 viral oncogene homolog 
(KRas2), mRNA; 5775 nt) 
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Table HI: Human H-Ras DNAzyme and Target molecules 



Pos 


Substrate 


Seq 
ID 


DNAzyme 


Seq 
ID 


9 


GGAUCCCA G CCUUUCCC 


1322 


GGGAAAGG GGCTAGCTACAACGA TGGGATCC 


3650 


20 


UUUCCCCA G CCCGUAGC 


1323 


GCTACGGG GGCTAGCTACAACGA TGGGGAAA 


3651 


24 


CCCAGCCC G UAGCCCCG 


1324 


CGGGGCTA GGCTAGCTACAACGA GGGCTGGG 


3652 


27 


AGCCCGUA G CCCCGGGA 


1325 


TCCCGGGG GGCTAGCTACAACGA TACGGGCT 


3653 


35 


GCCCCGGG A CCUCCGCG 


1326 


CGCGGAGG GGCTAGCTACAACGA CCCGGGGC 


3654 


41 


GGACCUCC G CGGUGGGC 


1327 


GCCCACCG GGCTAGCTACAACGA GGAGGTCC 


3655 


44 


CCUCCGCG G UGGGCGGC 


1328 


GCCGCCCA GGCTAGCTACAACGA CGCGGAGG 


3656 


48 


CGCGGUGG G CGGCGCCG 


1329 


CGGCGCCG GGCTAGCTACAACGA CCACCGCG 


3657 


51 


GGUGGGCG G CGCCGCGC 


1330 


GCGCGGCG GGCTAGCTACAACGA CGCCCACC 


3658 


53 


UGGGCGGC G CCGCGCUG 


1331 


CAGCGCGG GGCTAGCTACAACGA GCCGCCCA 


3659 


56 


GCGGCGCC G CGCUGCCG 


1332 


CGGCAGCG GGCTAGCTACAACGA GGCGCCGC 


3660 


58 


GGCGCCGC G CUGCCGGC 


1333 


GCCGGCAG GGCTAGCTACAACGA GCGGCGCC 


3661 


61 


GCCGCGCU G CCGGCGCA 


1334 


TGCGCCGG GGCTAGCTACAACGA AGCGCGGC 


3662 


65 


CGCUGCCG G CGCAGGGA 


1335 


TCCCTGCG GGCTAGCTACAACGA CGGCAGCG 


3663 


67 


CUGCCGGC G CAGGGAGG 


1336 


CCTCCCTG GGCTAGCTACAACGA GCCGGCAG 


3664 


76 


CAGGGAGG G CCUCUGGU 


1337 


ACCAGAGG GGCTAGCTACAACGA CCTCCCTG 


3665 


83 


GGCCUCUG G UGCACCGG 


1338 


CCGGTGCA GGCTAGCTACAACGA CAGAGGCC 


3666 


85 


CCUCUGGU G CACCGGCA 


1339 


TGCCGGTG GGCTAGCTACAACGA ACCAGAGG 


3667 


87 


UCUGGUGC A CCGGCACC 


1340 


GGTGCCGG GGCTAGCTACAACGA GCACCAGA 


3668 


91 


GUGCACCG G CACCGCUG 


1341 


CAGCGGTG GGCTAGCTACAACGA CGGTGCAC 


3669 


93 


GCACCGGC A CCGCUGAG 


1342 


CTCAGCGG GGCTAGCTACAACGA GCCGGTGC 


3670 


96 


CCGGCACC G CUGAGUCG 


1343 


CGACTCAG GGCTAGCTACAACGA GGTGCCGG 


3671 


101 


ACCGCUGA G UCGGGUUC 


1344 


GAACCCGA GGCTAGCTACAACGA TCAGCGGT 


3672 


106 


UGAGUCGG G UUCUCUCG 


1345 


CGAGAGAA GGCTAGCTACAACGA CCGACTCA 


3673 


114 


GUUCUCUC G CCGGCCUG 


1346 


CAGGCCGG GGCTAGCTACAACGA GAGAGAAC 


3674 


118 


UCUCGCCG G CCUGUUCC 


1347 


GGAACAGG GGCTAGCTACAACGA CGGCGAGA 


3675 


122 


GCCGGCCU G UUCCCGGG 


1348 


CCCGGGAA GGCTAGCTACAACGA AGGCCGGC 


3676 


134 


CCGGGAGA G CCCGGGGC 


1349 


GCCCCGGG GGCTAGCTACAACGA TCTCCCGG 


3677 


141 


AGCCCGGG G CCCUGCUC 


1350 


GAGCAGGG GGCTAGCTACAACGA CCCGGGCT 


3678 


146 


GGGGCCCU G CUCGGAGA 


1351 


TCTCCGAG GGCTAGCTACAACGA AGGGCCCC 


3679 


154 


GCUCGGAG A UGCCGCCC 


1352 


GGGCGGCA GGCTAGCTACAACGA CTCCGAGC 


3680 


156 


UCGGAGAU G CCGCCCCG 


1353 


CGGGGCGG GGCTAGCTACAACGA ATCTCCGA 


3681 


159 


GAGAUGCC G CCCCGGGC 


1354 


GCCCGGGG GGCTAGCTACAACGA GGCATCTC 


3682 


166 


CGCCCCGG G CCCCCAGA 


1355 


TCTGGGGG GGCTAGCTACAACGA CCGGGGCG 


3683 


174 


GCCCCCAG A CACCGGCU 


1356 


AGCCGGTG GGCTAGCTACAACGA CTGGGGGC 


3684 


176 


CCCCAGAC A CCGGCUCC 


1357 


GGAGCCGG GGCTAGCTACAACGA GTCTGGGG 


3685 


180 


AGACACCG G CUCCCUGG 


1358 


CCAGGGAG GGCTAGCTACAACGA CGGTGTCT 


3686 


188 


GCUCCCUG G CCUUCCUC 


1359 


GAGGAAGG GGCTAGCTACAACGA CAGGGAGC 


3687 


199 


UUCCUCGA G CAACCCCG 


1360 


CGGGGTTG GGCTAGCTACAACGA TCGAGGAA 


3688 


202 


CUCGAGCA A CCCCGAGC 


1361 


GCTCGGGG GGCTAGCTACAACGA TGCTCGAG 


3689 


209 


AACCCCGA G CUCGGCUC 


1362 


GAGCCGAG GGCTAGCTACAACGA TCGGGGTT 


3690 


214 


CGAGCUCG G CUCCGGUC 


1363 


GACCGGAG GGCTAGCTACAACGA CGAGCTCG 


3691 


220 


CGGCUCCG G UCUCCAGC 


1364 


GCTGGAGA GGCTAGCTACAACGA CGGAGCCG 


3692 


227 


GGUCUCCA G CCAAGCCC 


1365 


GGGCTTGG GGCTAGCTACAACGA TGGAGACC 


3693 


232 


CCAGCCAA G CCCAACCC 


1366 


GGGTTGGG GGCTAGCTACAACGA TTGGCTGG 


3694 


237 


CAAGCCCA A CCCCGAGA 


1367 


TCTCGGGG GGCTAGCTACAACGA TGGGCTTG 


3695 


247 


. CCCGAGAG G CCGCGGCC 


1368 


GGCCGCGG GGCTAGCTACAACGA CTCTCGGG 


3696 


250 


GAGAGGCC G CGGCCCUA 


1369 


TAGGGCCG GGCTAGCTACAACGA GGCCTCTC 


3697 
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253 


AGGCCGCG G CCCUACUG 


1370 


CAGTAGGG GGCTAGCTACAACGA CGCGGCCT 


3698 


258 


GCGGCCCU A CUGGCUCC 


1371 


GGAGCCAG GGCTAGCTACAACGA AGGGCCGC 


3699 


262 


CCCUACUG G CUCCGCCU 


1372 


AGGCGGAG GGCTAGCTACAACGA CAGTAGGG 


3700 


267 


CUGGCUCC G CCUCCCGC 


1373 


GCGGGAGG GGCTAGCTACAACGA GGAGCCAG 


3701 


274 


CGCCUCCC G CGUUGCUC 


1374 


GAGCAACG GGCTAGCTACAACGA GGGAGGCG 


3702 


276 


CCUCCCGC G UUGCUCCC 


1375 


GGGAGCAA GGCTAGCTACAACGA GCGGGAGG 


3703 


279 


CCCGCGUU G CUCCCGGA 


1376 


TCCGGGAG GGCTAGCTACAACGA AACGCGGG 


3704 


289 


UCCCGGAA G CCCCGCCC 


1377 


GGGCGGGG GGCTAGCTACAACGA TTCCGGGA 


3705 


294 


GAAGCCCC G CCCGACCG 


1378 


CGGTCGGG GGCTAGCTACAACGA GGGGCTTC 


3706 


299 


CCCGCCCG A CCGCGGCU 


1379 


AGCCGCGG GGCTAGCTACAACGA CGGGCGGG 


3707 


302 


GCCCGACC G CGGCUCCU 


1380 


AGGAGCCG GGCTAGCTACAACGA GGTCGGGC 


3708 


305 


CGACCGCG G CUCCUGAC 


1381 


GTCAGGAG GGCTAGCTACAACGA CGCGGTCG 


3709 


312 


GGCUCCUG A CAGACGGG 


1382 


CCCGTCTG GGCTAGCTACAACGA CAGGAGCC 


3710 


316 


CCUGACAG A CGGGCCGC 


1383 


GCGGCCCG GGCTAGCTACAACGA CTGTCAGG 


3711 


320 


ACAGACGG G CCGCUCAG 


1384 


CTGAGCGG GGCTAGCTACAACGA CCGTCTGT 


3712 


323 


GACGGGCC G CUCAGCCA 


1385 


TGGCTGAG GGCTAGCTACAACGA GGCCCGTC 


3713 


328 


GCCGCUCA G CCAACCGG 


1386 


CCGGTTGG GGCTAGCTACAACGA TGAGCGGC 


3714 


332 


CUCAGCCA A CCGGGGUG 


1387 


CACCCCGG GGCTAGCTACAACGA TGGCTGAG 


3715 


338 


CAACCGGG G UGGGGCGG 


1388 


CCGCCCCA GGCTAGCTACAACGA CCCGGTTG 


3716 


343 


GGGGUGGG G CGGGGCCC 


1389 


GGGCCCCG GGCTAGCTACAACGA CCCACCCC 


3717 


348 


GGGGCGGG G CCCGAUGG 


1390 


CCATCGGG GGCTAGCTACAACGA CCCGCCCC 


3718 


353 


GGGGCCCG A UGGCGCGC 


1391 


GCGCGCCA GGCTAGCTACAACGA CGGGCCCC 


3719 


356 


GCCCGAUG G CGCGCAGC 


1392 


GCTGCGCG GGCTAGCTACAACGA CATCGGGC 


3720 


358 


CCGAUGGC G CGCAGCCA 


1393 


TGGCTGCG GGCTAGCTACAACGA GCCATCGG 


3721 


360 


GAUGGCGC G CAGCCAAU 


1394 


ATTGGCTG GGCTAGCTACAACGA GCGCCATC 


3722 


363 


GGCGCGCA G CCAAUGGU 


1395 


ACCATTGG GGCTAGCTACAACGA TGCGCGCC 


3723 


367 


CGCAGCCA A UGGUAGGC 


1396 


GCCTACCA GGCTAGCTACAACGA TGGCTGCG 


3724 


370 


AGCCAAUG G UAGGCCGC 


1397 


GCGGCCTA GGCTAGCTACAACGA CATTGGCT 


3725 


374 


AAUGGUAG G CCGCGCCU 


1398 


AGGCGCGG GGCTAGCTACAACGA CTACCATT 


3726 


377 


GGUAGGCC G CGCCUGGC 


1399 


GCCAGGCG GGCTAGCTACAACGA GGCCTACC 


3727 


379 


UAGGCCGC G CCUGGCAG 


1400 


CTGCCAGG GGCTAGCTACAACGA GCGGCCTA 


3728 


384 


CGCGCCUG G CAGACGGA 


1401 


TCCGTCTG GGCTAGCTACAACGA CAGGCGCG 


3729 


388 


CCUGGCAG A CGGACGGG 


1402 


CCCGTCCG GGCTAGCTACAACGA CTGCCAGG 


3730 


3 92 


GCAGACGG A CGGGCGCG 


1403 


CGCGCCCG GGCTAGCTACAACGA CCGTCTGC 


3731 


396 


ACGGACGG G CGCGGGGC 


1404 


GCCCCGCG GGCTAGCTACAACGA CCGTCCGT 


3732 


398 


GGACGGGC G CGGGGCGG 


1405 


CCGCCCCG GGCTAGCTACAACGA GCCCGTCC 


3733 


403 


GGCGCGGG G CGGGGCGU 


1406 


ACGCCCCG GGCTAGCTACAACGA CCCGCGCC 


3734 


408 


GGGGCGGG G CGUGCGCA 


1407 


TGCGCACG GGCTAGCTACAACGA CCCGCCCC 


3735 


410 


GGCGGGGC G UGCGCAGG 


1408 


CCTGCGCA GGCTAGCTACAACGA GCCCCGCC 


3736 


412 


CGGGGCGU G CGCAGGCC 


1409 


GGCCTGCG GGCTAGCTACAACGA ACGCCCCG 


3737 


414 


GGGCGUGC G CAGGCCCG 


1410 


CGGGCCTG GGCTAGCTACAACGA GCACGCCC 


3738 


418 


GUGCGCAG G CCCGCCCG 


1411 


CGGGCGGG GGCTAGCTACAACGA CTGCGCAC 


3739 


422 


GCAGGCCC G CCCGAGUC 


1412 


GACTCGGG GGCTAGCTACAACGA GGGCCTGC 


3740 


428 


CCGCCCGA G UCUCCGCC 


1413 


GGCGGAGA GGCTAGCTACAACGA TCGGGCGG 


3741 


434 


GAGUCUCC G CCGCCCGU 


1414 


ACGGGCGG GGCTAGCTACAACGA GGAGACTC 


3742 


437 


UCUCCGCC G CCCGUGCC 


1415 


GGCACGGG GGCTAGCTACAACGA GGCGGAGA 


3743 


441 


CGCCGCCC G UGCCCUGC 


1416 


GCAGGGCA GGCTAGCTACAACGA GGGCGGCG 


3744 


443 


CCGCCCGU G CCCUGCGC 


1417 


GCGCAGGG GGCTAGCTACAACGA ACGGGCGG 


3745 


448 


CGUGCCCU G CGCCCGCA 


1418 


TGCGGGCG GGCTAGCTACAACGA AGGGCACG 


3746 


450 


UGCCCUGC G CCCGCAAC 


1419 


GTTGCGGG GGCTAGCTACAACGA GCAGGGCA 


3747 


454 


CUGCGCCC G CAACCCGA 


1420 


TCGGGTTG GGCTAGCTACAACGA GGGCGCAG 


3748 


457 


CGCCCGCA A CCCGAGCC 


1421 


GGCTCGGG GGCTAGCTACAACGA TGCGGGCG 


3749 
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463 


CAACCCGA G CCGCACCC 


1422 


GGGTGCGG GGCTAGCTACAACGA TCGGGTTG 


3750 


466 


CCCGAGCC G CACCCGCC 


1423 


GGCGGGTG GGCTAGCTACAACGA GGCTCGGG 


3751 


468 


CGAGCCGC A CCCGCCGC 


1424 


GCGGCGGG GGCTAGCTACAACGA GCGGCTCG 


3752 


472 


CCGCACCC G CCGCGGAC 


1425 


GTCCGCGG GGCTAGCTACAACGA GGGTGCGG 


3753 


475 


CACCCGCC G CGGACGGA 


1426 


TCCGTCCG GGCTAGCTACAACGA GGCGGGTG 


3754 


479 


CGCCGCGG A CGGAGCCC 


1427 


GGGCTCCG GGCTAGCTACAACGA CCGCGGCG 


3755 


484 


CGGACGGA G CCCAUGCG 


1428 


CGCATGGG GGCTAGCTACAACGA TCCGTCCG 


3756 


488 


CGGAGCCC A UGCGCGGG 


1429 


CCCGCGCA GGCTAGCTACAACGA GGGCTCCG 


3757 


490 


GAGCCCAU G CGCGGGGC 


1430 


GCCCCGCG GGCTAGCTACAACGA ATGGGCTC 


3758 


492 


GCCCAUGC G CGGGGCGA 


1431 


TCGCCCCG GGCTAGCTACAACGA GCATGGGC 


3759 


497 


UGCGCGGG G CGAACCGC 


1432 


GCGGTTCG GGCTAGCTACAACGA CCCGCGCA 


3760 


501 


CGGGGCGA A CCGCGCGC 


1433 


GCGCGCGG GGCTAGCTACAACGA TCGCCCCG 


3761 


504 


GGCGAACC G CGCGCCCC 


1434 


GGGGCGCG GGCTAGCTACAACGA GGTTCGCC 


3762 


506 


CGAACCGC G CGCCCCCG 


1435 


CGGGGGCG GGCTAGCTACAACGA GCGGTTCG 


3763 


508 


AACCGCGC G CCCCCGCC 


1436 


GGCGGGGG GGCTAGCTACAACGA GCGCGGTT 


3764 


514 


GCGCCCCC G CCCCCGCC 


1437 


GGCGGGGG GGCTAGCTACAACGA GGGGGCGC 


3765 


520 


CCGCCCCC G CCCCGCCC 


1438 


GGGCGGGG GGCTAGCTACAACGA GGGGGCGG 


3766 


525 


CCCGCCCC G CCCCGGCC 


1439 


GGCCGGGG GGCTAGCTACAACGA GGGGCGGG 


3767 


531 


CCGCCCCG G CCUCGGCC 


1440 


GGCCGAGG GGCTAGCTACAACGA CGGGGCGG 


3768 


537 


CGGCCUCG G CCCCGGCC 


1441 


GGCCGGGG GGCTAGCTACAACGA CGAGGCCG 


3769 


543 


CGGCCCCG G CCCUGGCC 


1442 


GGCCAGGG GGCTAGCTACAACGA CGGGGCCG 


3770 


549 


CGGCCCVG G CCCCGGGG 


1443 


CCCCGGGG GGCTAGCTACAACGA CAGGGCCG 


3771 


558 


CCCCGGGG G CAGUCGCG 


1444 


CGCGACTG GGCTAGCTACAACGA CCCCGGGG 


3772 


561 


CGGGGGCA G UCGCGCCU 


1445 


AGGCGCGA GGCTAGCTACAACGA TGCCCCCG 


3773 


564 


GGGCAGUC G CGCCUGUG 


1446 


CACAGGCG GGCTAGCTACAACGA GACTGCCC 


3774 


566 


GCAGUCGC G CCUGUGAA 


1447 


TTCACAGG GGCTAGCTACAACGA GCGACTGC 


3775 


570 


UCGCGCCU G UGAACGGU 


1448 


ACCGTTCA GGCTAGCTACAACGA AGGCGCGA 


3776 


574 


GCCUGUGA A CGGUGAGU 


1449 


ACTCACCG GGCTAGCTACAACGA TCACAGGC 


3777 


577 


UGUGAACG G UGAGUGCG 


1450 


CGCACTCA GGCTAGCTACAACGA CGTTCACA 


3778 


581 


AACGGUGA G UGCGGGCA 


1451 


TGCCCGCA GGCTAGCTACAACGA TCACCGTT 


3779 


583 


CGGUGAGU G CGGGCAGG 


1452 


CCTGCCCG GGCTAGCTACAACGA ACTCACCG 


3780 


587 


GAGUGCGG G CAGGGAUC 


1453 


GATCCCTG GGCTAGCTACAACGA CCGCACTC 


3781 


593 


GGGCAGGG A UCGGCCGG 


1454 


CCGGCCGA GGCTAGCTACAACGA CCCTGCCC 


3782 


597 


AGGGAUCG G CCGGGCCG 


1455 


CGGCCCGG GGCTAGCTACAACGA CGATCCCT 


3783 


602 


UCGGCCGG G CCGCGCGC 


1456 


GCGCGCGG GGCTAGCTACAACGA CCGGCCGA 


3784 


605 


GCCGGGCC G CGCGCCCU 


1457 


AGGGCGCG GGCTAGCTACAACGA GGCCCGGC 


3785 


607 


CGGGCCGC G CGCCCUCC 


1458 


GGAGGGCG GGCTAGCTACAACGA GCGGCCCG 


3786 


609 


GGCCGCGC G CCCUCCUC 


1459 


GAGGAGGG GGCTAGCTACAACGA GCGCGGCC 


3787 


618 


CCCUCCUC G CCCCCAGG 


1460 


CCTGGGGG GGCTAGCTACAACGA GAGGAGGG 


3788 


626 


GCCCCCAG G CGGCAGCA 


1461 


TGCTGCCG GGCTAGCTACAACGA CTGGGGGC 


3789 I 


629 


CCCAGGCG G CAGCAAUA 


1462 


TATTGCTG GGCTAGCTACAACGA CGCCTGGG 


3790 


632 


AGGCGGCA G CAAUACGC 


1463 


GCGTATTG GGCTAGCTACAACGA TGCCGCCT 


3791 


635 


CGGCAGCA A UACGCGCG 


1464 


CGCGCGTA GGCTAGCTACAACGA TGCTGCCG 


3792 


637 


GCAGCAAU A CGCGCGGC 


1465 


GCCGCGCG GGCTAGCTACAACGA ATTGCTGC 


3793 


639 


AGCAAUAC G CGCGGCGC 


1466 


GCGCCGCG GGCTAGCTACAACGA GTATTGCT 


3794 


641 


CAAUACGC G CGGCGCGG 


1467 


CCGCGCCG GGCTAGCTACAACGA GCGTATTG 


3795 


644 


UACGCGCG G CGCGGGCC 


1468 


GGCCCGCG GGCTAGCTACAACGA CGCGCGTA 


3796 


646 


CGCGCGGC G CGGGCCGG 


1469 


CCGGCCCG GGCTAGCTACAACGA GCCGCGCG 


3797 


650 


CGGCGCGG G CCGGGGGC 


1470 


GCCCCCGG GGCTAGCTACAACGA CCGCGCCG 


3798 


657 


GGCCGGGG G CGCGGGGC 


1471 


GCCCCGCG GGCTAGCTACAACGA CCCCGGCC 


3799 


659 


CCGGGGGC G CGGGGCCG 


1472 


CGGCCCCG GGCTAGCTACAACGA GCCCCCGG 


3800 


664 


GGCGCGGG G CCGGCGGG 


1473 


CCCGCCGG GGCTAGCTACAACGA CCCGCGCC 


3801 
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OQO 


CCZCZCCC CO. P CCCCCCllTi 


1474 


TACCCCCCi PPPTB.PPT&P&ZvPPZV CCCCCCCC 


3802 


672 


firmRpno p pptta appp 


1475 


PPPTTAPP PPPTAPPTAPAAPPA CCCCCCCC 


3803 


674 


pppprppp a uaappppp 


1476 
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1613 


GGGAGGCG GGCTAGCTACAACGA CCGAGGGC 


3941 




pp? tr^r^ffc* p ppttppptttt 
LLULbbvjL. G LlUlLLUU 


1614 


AAGGGAGG GGCTAGCTACAACGA GCCCGAGG 


3942 


T *3 n c 
1 J U b 


TTPPPITTTTTA P riot TT TT T/^T TP 

ULLlUUUA g lluuulug 


1615 


CAGAAAGG GGCTAGCTACAACGA TAAAGGGA 


3943 


i on i 
1j1 J 


PPPITTTTTPTT P PPPA PP/^A 

GL.L.UUUGU G GGGAGGGA 


1616 


TGGGTCGG GGCTAGCTACAACGA AGAAAGGC 


3944 


JLO JL / 


TTTTPTTPPPP A PPPAPPAP 

UUlUuLHj a gggaggag 


1617 


LrGCTGGG GGCTAGCTACAACGA CGGCAGAA 


3945 


i no 
1 Jz z 


CCGACCCA G CaGCUUCU 


1618 


AGAAGCTG GGCTAGCTACAACGA TGGGTCGG 


3946 


1325 


ACCCAGCA G CUUCUAAU 


1619 


ATTAGAAG GGCTAGCTACAACGA TGCTGGGT 


3947 


1 ^ 9 


APpTTTTPTTA A T TT TT TP PPT TP 


1620 


L ALL L AAA GGCTAGCTACAACGA TAGAAGCT 


3948 


1338 


UAAUUUGG G UGCGUGGU 


1621 


ACCACGCA GGCTAGCTACAACGA CCAAATTA 


3949 


1340 


auuugggu g cgugguug 


1622 


CAACCACG GGCTAGCTACAACGA ACCCAAAT 


3950 


1342 


UUGGGUGC G UGGUUGAG 


1623 


CTCAACCA GGCTAGCTACAACGA GCACCCAA 


3951 


1345 


GGUGCGUG g UUGAGAGC 


1624 


GCTCTCAA GGCTAGCTACAACGA CACGCACC 


3952 


1352 


GGUUGAGA G CGCUCAGC 


1625 


GCTGAGCG GGCTAGCTACAACGA TCTCAACC 


3953 


1354 


UUGAGAGC G CUCAGCUG 


1626 


CAGCTGAG GGCTAGCTACAACGA GCTCTCAA 


3954 


1359 


AGCGCUCA G CUGUCAGC 


1627 


GCTGACAG GGCTAGCTACAACGA TGAGCGCT 


3955 


1362 


GCUCAGCU G UCAGCCCU 


1628 


AGGGCTGA GGCTAGCTACAACGA AGCTGAGC 


3956 


1366 


AGCUGUCA G CCCUGCCU 


1629 


AGGCAGGG GGCTAGCTACAACGA TGACAGCT 


3957 
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1371 


UCAGCCCU G CCUUUGAG 


1630 


CTCAAAGG GG C TAG CTACAACGA AGGGCTGA 


3958 


1381 


CUUUGAGG G CUGGGUCC 


1631 


GGACCCAG GG CT AGCTACAACG A CCTCAAAG 


3959 


1386 


AGGGCUGG G UCCCUUUU 


1632 


AAAAGGGA GG CTAGCTACAACG A CCAGCCCT 


3960 


13 98 


CUUUUCCC A UCACUGGG 


1633 


CCCAGTGA GG CTAGCT ACAACGA GGGAAAAG 


3961 


1401 


UUCCCAUC A CUGGGUCA 


1634 


TGACCCAG GGCTAGCTACAACGA GATGGGAA 


3962 


1406 


AUCACUGG G UCAUUAAG 


1635 


CTTAATGA GGCTAGCTACAACGA CCAGTGAT 


3963 


1409 


ACUGGGUC A UUAAGAGC 


1636 


GCTCTTAA GGCTAGCTACAACGA GACCCAGT 


3964 


1416 


CAUUAAGA G CAAGUGGG 


1637 


CCCACTTG GGCTAGCTACAACGA TCTTAATG 


3965 


1420 


AAGAGCAA G UGGGGGCG 


1638 


CGCCCCCA GGCTAGCTACAACGA TTGCTCTT 


3966 


1426 


AAGUGGGG G CGAGGCGA 


1639 


TCGCCTCG GGCTAGCTACAACGA CCCCACTT 


3967 


1431 


GGGGCGAG G CGACAGCC 


1640 


GGCTGTCG GGCTAGCTACAACGA CTCGCCCC 


3968 


1434 


GCGAGGCG A CAGCCCUC 


1641 


GAGGGCTG GGCTAGCTACAACGA CGCCTCGC 


3969 


1437 


AGGCGACA G CCCUCCCG 


1642 


CGGGAGGG GGCTAGCTACAACGA TGTCGCCT 


3970 


1445 


GCCCUCCC G CACGCUGG 


1643 


CCAGCGTG GGCTAGCTACAACGA GGGAGGGC 


3971 


1447 


CCUCCCGC A CGCUGGGU 


1644 


ACCCAGCG GGCTAGCTACAACGA GCGGGAGG 


3972 


1449 


UCCCGCAC G CUGGGUUG 


1645 


CAACCCAG GGCTAGCTACAACGA GTGCGGGA 


3973 | 


1454 


CACGCUGG G UUGCAGCU 


1646 


AGCTGCAA GGCTAGCTACAACGA CCAGCGTG 


3974 


1457 


GCUGGGUU G CAGCUGCA 


1647 


TGCAGCTG GGCTAGCTACAACGA AACCCAGC 


3975 


1460 


GGGUUGCA G CUGCACAG 


1648 


CTGTGCAG GGCTAGCTACAACGA TGCAACCC 


3976 


1463 


UUGCAGCU G CACAGGUA 


1649 


TACCTGTG GGCTAGCTACAACGA AGCTGCAA 


3977 


1465 


GCAGCUGC A CAGGUAGG 


1650 


CCTACCTG GGCTAGCTACAACGA GCAGCTGC 


3978 


1469 


CUGCACAG G UAGGCACG 


1651 


CGTGCCTA GGCTAGCTACAACGA CTGTGCAG 


3979 


1473 


ACAGGUAG G CACGCUGG 


1652 


GCAGCGTG GGCTAGCTACAACGA CTACCTGT 


3980 


1475 


AGGUAGGC A CGCUGCAG 


1653 


CTGCAGCG GGCTAGCTACAACGA GCCTACCT 


3981 


1477 


GUAGGCAC G CUGCAGUC 


1654 


GACTGCAG GGCTAGCTACAACGA GTGCCTAC 


3982 


1480 


GGCACGCU G CAGUCCUU 


1655 


AAGGACTG GGCTAGCTACAACGA AGCGTGCC 


3983 


1483 


ACGCUGCA G UCCUUGCU 


1656 


AGCAAGGA GGCTAGCTACAACGA TGCAGCGT 


3984 


1489 


CAGUCCUU G CUGCCUGG 


1657 


CCAGGCAG GGCTAGCTACAACGA AAGGACTG 


3985 


1492 


UCCUUGCU G CCUGGCGU 


1658 


ACGCCAGG GGCTAGCTACAACGA AGCAAGGA 


3986 


1497 


GCUGCCUG G CGUUGGGG 


1659 


CCCCAACG GGCTAGCTACAACGA CAGGCAGC 


3987 


1499 


UGCCUGGC G UUGGGGCC 


1660 


GGCCCCAA GGCTAGCTACAACGA GCCAGGCA 


3988 


1505 


GCGUUGGG G CCCAGGGA 


1661 


TCCCTGGG GGCTAGCTACAACGA CCCAACGC 


3989 


1513 


GCCCAGGG A CCGCUGUG 


1662 


CACAGCGG GGCTAGCTACAACGA CCCTGGGC 


3990 


1516 


CAGGGACC G CUGUGGGU 


1663 


ACCCACAG GGCTAGCTACAACGA GGTCCCTG 


3991 


1519 


GGACCGCU G UGGGUUUG 


1664 


CAAACCCA GGCTAGCTACAACGA AGCGGTCC 


3992 


1523 


CGCUGUGG G UUUGCCCU 


1665 


AGGGCAAA GGCTAGCTACAACGA CCACAGCG 


3993 


1527 


GUGGGUUU G CCCUUCAG 


1666 


CTGAAGGG GGCTAGCTACAACGA AAACCCAC 


3994 


1536 


CCCUUCAG A UGGCCCUG 


1667 


CAGGGCCA GGCTAGCTACAACGA CTGAAGGG 


3995 


1539 


UUCAGAUG G CCCUGCCA 


1668 


TGGCAGGG GGCTAGCTACAACGA CATCTGAA 


3996 


1544 


AUGGCCCU G CCAGCAGC 


1669 


GCTGCTGG GGCTAGCTACAACGA AGGGCCAT 


3997 


1548 


CCCUGCCA G CAGCUGCC 


1670 


GGCAGCTG GGCTAGCTACAACGA TGGCAGGG 


3998 


1551 


UGCCAGCA G CUGCCCUG 


1671 


CAGGGCAG GGCTAGCTACAACGA TGCTGGCA 


3999 


1554 


CAGCAGCU G CCCUGUGG 


1672 


CCACAGGG GGCTAGCTACAACGA AGCTGCTG 


4000 


1559 


GCUGCCCU G UGGGGCCU 


1673 


AGGCCCCA GGCTAGCTACAACGA AGGGCAGC 


4 001 


1564 


CCUGUGGG G CCUGGGGC 


1674 


GCCCCAGG GGCTAGCTACAACGA CCCACAGG 


4002 


1571 


GGCCUGGG G CUGGGCCU 


1675 


AGGCCCAG GGCTAGCTACAACGA CCCAGGCC 


4003 


1576 


GGGGCUGG G CCUGGGCC 


1676 


GGCCCAGG GGCTAGCTACAACGA CCAGCCCC 


4004 


1582 


GGGCCUGG G CCUGGCUG 


1677 


CAGCCAGG GGCTAGCTACAACGA CCAGGCCC 


4005 


1587 


UGGGCCUG G CUGAGCAG 


1678 


CTGCTCAG GGCTAGCTACAACGA CAGGCCCA 


4006 


1592 


CUGGCUGA G CAGGGCCC 


1679 


GGGCCCTG GGCTAGCTACAACGA TCAGCCAG 


4007 


1597 


UGAGCAGG G CCCUCCUU 


1680 


AAGGAGGG GGCTAGCTACAACGA CCTGCTCA 


4008 


1607 


CCUCCUUG G CAGGUGGG 


1681 


CCCACCTG GGCTAGCTACAACGA CAAGGAGG 


4009 
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1611 


LUUGGCAG G UGGGGCAG 


1682 


pi<"npi/-i/-i/"ip» A nnr"T7\r'/nmTiPTi7irtn5\ *"irn/*i /-lyi tv 7VP» 

C I GGC CCA GGC I AGCTACAACGA CTGCCAAG 


4010 


1 bib 


UAGGUGGG G UAl^GAGAG 


1683 


pirn pirn pi p^rppi PPPT t Apr w T'7iPJVRPP7i PipipiTV piP 1 T 1 P' 

Lj 1 L J. GG 1 o GGG J. AGG I AGAAGGA GGCACG 1 G 


4011 


1 bz j 


GGCAGGAlj A LLLUbUAb 


1684 


PTAPTiPPP PPPTJ\ P'PTA r>Ti A PP A P 1 »T'PiP"T»P^P'P 1 

UIAGAvjIjG GGG 1AGG1 AGAAGGA GJ.CCJLGCG 


4012 


1628 


t*iT\ s~it\ /if~>/~iTl P"» fTft Plpi 7\ P*pi A 

GAGACCCU G UAGGAGGA 


1685 


rrt/~</~irrt/~1f~irr\ j\ priPTAPn/TiTlPA R/^rfTt TV /"H-trim/intr* 

1GG1GG1A GGC 1 AGCTACAACGA AGGGTCTC 


4013 


1 coc 


GUAGGAGG A GGGGGGGG 


1686 


pipipipi r^CiCr* PPPTSPPT'TiPASPPJ\ /-i/-irpp^p»rp7\ f-i 

GGGGGGGvj GGG 1 AGG I AGAAGGA CCiCC-lAC 


4014 


lD4 J 


p -1 7\ (~*r*rTT*<r* r» pipipip^ a rrT* 
GAGGGGGG G GGGCAGGG 


1687 


pi /-i pirp/^i /-ipi P 1 PPPTRPPTTiPATlPPn rTTTi/T' r T*t~* 

GGG1GGGG GGG 1 AGG 1 AGAAGGA LGGGGG1L 


4015 


Xo4b 


r~*r*r>r~T , ccr< r* pi a pi pi pip* pr* 
GGGGGGGG G GAGGGGGG 


1688 


(T'rTT'f'vr* pppttippt'tiPiisppa p , p i r^r^r^r^r^r^ 
LitjtaGGGHj GGGXAcrG I AGAAGGA GGGGGGGG 


4016 


i c c n 


pjpiptpipi pi a pi p - * p i p i p^p i ttp < a p 1 
IjGCGGGAG G GGGCUGALr 


1689 


GlUAGGGti GGG 1AGG1 AGAAGGA GiGGGGGG 


4017 


lob ± 


P 1 PT TP 1 A P»P"i A pi P*P* A T TP"< 7\ P"Pi 

UGUGAGGA G GGAUGAGG 


1690 


pip>rp<-« iifppp PPPTTAPPfPTlPAAPPTN TP<pn~pP"i7\ P"ip* 

L-tj 1 GA 1 GG GGG 1 AGGTAGAACGA a GC1 GAGG 


4018 


1664 


PAPPTlPPP A T A P 1 P 1 P 1 A A 

GAGGAGCG A UGACGGAA 


1691 


TTppl^PirpP"' A /"•/"» P"T1 7\ P»/*WT>7\ Pi TV 7\ /~1/~»7\ 0/~< #Tm/*1/~im/^ 

1 1 GCG 1 GA GGG I AGCTACAACGA CGCTCCTC 


4019 


1667 


P 1 7\ P* P 1 P* A T 7ri A PPP7\7\TTnTl 

GAGCGAUG A GGGAAUAU 


1692 


A T»A H»PPPP PiP*ipirpTV PV'wi ITV P»A A /-in TV f* TV rn/-ir» r»T P» 

ATATTCCG GGCT AGCTACAACGA CATCGCTC 


4020 


1672 


A TIP* A PW"» A A TTA TTTV IVPPTf 

AUGACGGA A UAUAAGCU 


1693 


AGCTTATA GGCTAGCTACAACGA TCCGTCAT 


4021 


1674 


pi 7\ P 1 P 1 P 1 A ATT A nB7\PPTTPP 

GAGGGAAU A UAAGCUGG 


1694 


GCAGGT1A GGCTAGCTACAACGA ATTCCGTC 


4022 


lb /o 


P 1 A A T TA T TA A pi fTTPipiTTPipiTT 

GAAUAUAA G GUGGUGGU 


1695 


J\PP7\PP5\P PPPT7\PPT>APTiHPP7i mrpii rnTV mrnr" 

AG GAG GAG GGG 1 AGG 1 AGAAGGA 1 1 ATA 1 1 G 


4023 


1682 


ATTA7\PPTTP Pi T TP*PiT TP 1 P" TIP* 

AUAAGGUG G UGGUGGUG 


1696 


P»Tt Prn\ PiP»7\ /^/^nrrtTV Pi/irMTv PiTV TV /*>mv nivrirmwrnfr 

CACCACCA GGCTAGCTACAACGA CAGCTTAT 


4024 


1 CO c 

1685 


7\ /""•/"IT TP'P" , T TP 1 P 1 T TP? P" 1 T TP 1 P* P" 1 P 1 

AGCUGGUG G UGGUGGGG 


1697 


pipi/~i/~iTV ptp»7v o /-irp 7V n i^»n 7V /-i TV TV TV rtTi rionnnrp 

GCCCACCA GGCTAGCTACAACGA CACCAGCT 


4025 


1688 


UGGUGGUG G UGGGCGCC 


1698 


GGCGCCCA GGCTAGCTACAACGA CACCACCA 


4026 


1692 


GGUGGUGG G CGCCGGCG 


1699 


CGCCGGCG GGCTAGCTACAACGA CCACCACC 


4027 1 


1694 


UGGUGGGC G CCGGCGGU 


1700 


ACCGCCGG GGCTAGCTACAACGA GCCCACCA 


4028 


1698 


GGGCGCCG G CGGUGUGG 


1701 


CCACACCG GGCTAGCTACAACGA CGGCGCCC 


4029 


1701 


CGCCGGCG G UGUGGGCA 


1702 


TGCCCACA GGCTAGCTACAACGA CGCCGGCG 


4030 


1703 


CCGGCGGU G UGGGCAAG 


1703 


CTTGCCCA GGCTAGCTACAACGA ACCGCCGG 


4031 


1707 


On nt Tint Tn/^ r»Tki\nivrtTT/i 

CGGUGUGG G CAAGAGUG 


1704 


CACTCTTG GGCTAGCTACAACGA CCACACCG 


4032 


1713 


GGGCAAGA G UGCGCUGA 


1705 


TCAGCGCA GGCTAGCTACAACGA TCTTGCCC 


4033 


1715 


GCAAGAGU G CGCUGACC 


1706 


GGTCAGCG GGCTAGCTACAACGA ACTCTTGC 


4034 


1717 


AAGAGUGC G CUGACCAU 


1707 


ATGGTCAG GGCTAGCTACAACGA GCACTCTT 


4035 


1721 


GUGCGCUG A CCAUCCAG 


1708 


CTGGATGG GGCTAGCTACAACGA CAGCGCAC 


4036 


1724 


CGCUGACC A UCCAGCUG 


1709 


CAGCTGGA GGCTAGCTACAACGA GGTCAGCG 


4037 


1729 


ACCAUCCA G CUGAUCCA 


1710 


TGGATCAG GGCTAGCTACAACGA TGGATGGT 


4038 


1733 


UCCAGCUG A UCCAGAAC 


1711 


GTTCTGGA GGCTAGCTACAACGA CAGCTGGA 


4039 


1740 


GAUCCAGA A CCAUUUUG 


1712 


CAAAATGG GGCTAGCTACAACGA TCTGGATC 


4040 


1743 


CCAGAACC A UUUUGUGG 


1713 


CCACAAAA GGCTAGCTACAACGA GGTTCTGG 


4041 


1748 


ACCAUUUU G UGGACGAA 


1714 


TTCGTCCA GGCTAGCTACAACGA AAAATGGT 


4042 


1752 


UUUUGUGG A CGAAUACG 


1715 


CGTATTCG GGCTAGCTACAACGA CCACAAAA 


4043 


1756 


GUGGACGA A UACGACCC 


1716 


GGGTCGTA GGCTAGCTACAACGA TCGTCCAC 


4044 


1758 


PI /-I * /"Ipl TV 7V T T TV /l/ITV /-1/""1/"1/"1TV 

GGACGAAU A CGACCCCA 


1717 


TGGGGTCG GGCTAGCTACAACGA ATTCGTCC 


4045 


1761 


CGAAUACG A CCCCACUA 


1718 


TAGTGGGG GGCTAGCTACAACGA CGTATTCG 


4046 


1766 


ACGACCCC A CUAUAGAG 


1719 


CTCTATAG GGCTAGCTACAACGA GGGGTCGT 


4047 


1769 


ACCCCACU A UAGAGGAU 


1720 


ATCCTCTA GGCTAGCTACAACGA hGTGGGGT 


4048 


1776 


UAUAGAGG A UUCCUACC 


1721 


GGTAGGAA GGCTAGCTACAACGA CCTCTATA 


4049 


1782 


GGAUUCCU A CCGGAAGC 


1722 


GCTTCCGG GGCTAGCTACAACGA AGGAATCC 


4050 


1789 


UACCGGAA G CAGGUGGU 


1723 


ACCACCTG GGCTAGCTACAACGA TTCCGGTA 


4051 


1793 


GGAAGCAG G UGGUCAUU 


1724 


AATGACCA GGCTAGCTACAACGA CTGCTTCC 


4052 


1796 


AGCAGGUG G UCAUUGAU 


1725 


ATCAATGA GGCTAGCTACAACGA CACCTGCT 


4053 


1799 


AGGUGGUC A UUGAUGGG 


1726 


CCCATCAA GGCTAGCTACAACGA GACCACCT 


4054 


1803 


GGUCAUUG A UGGGGAGA 


1727 


TCTCCCCA GGCTAGCTACAACGA CAATGACC 


4055 


1811 


AUGGGGAG A CGUGCCUG 


1728 


CAGGCACG GGCTAGCTACAACGA CTCCCCAT j 


4056 


1813 


GGGGAGAC G UGCCUGUU 


1729 


AACAGGCA GGCTAGCTACAACGA GTCTCCCC 


4057 


1815 


GGAGACGU G CCUGUUGG 


1730 


CCAACAGG GGCTAGCTACAACGA ACGTCTCC 


4058 


1819 


ACGUGCCU G UUGGACAU 


1731 


ATGTCCAA GGCTAGCTACAACGA AGGCACGT 


4059 


1824 


CCUGUUGG A CAUCCUGG 


1732 


CCAGGATG GGCTAGCTACAACGA CCAACAGG 


4060 


1826 


UGUUGGAC A UCCUGGAU 


1733 


ATCCAGGA GGCTAGCTACAACGA GTCCAACA 


4061 
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1833 


paitppttpp a tta pppppp 


1734 


ppppppta P,nPTAPPTAPaapp.a ppapp, ato 


4062 




ttppttppatt a rrrzrcacic 


1735 


pppppfpp ppptap.pt APaapn a aTPPappA 


4063 


1 ft *} ft 


Trppanapp p r , p^3TT , a^ 


1736 


ptpppppp ppPTAPPTAPaappa pPTaTPPA 


4064 


1 fid'? 


ttapppppp p ppappapp 


1737 


cc^oo^nco pppTappTAPaappa ppppppTa 

V_L- X L.L XVjVj VjkaL.±A\j^lAl—AA^oA LuUvUVJ Jl jHl 


4065 




L^AOOAOOA O UAV^AOl—OL- 


1738 


(^pnrTP.T'a ppPTappTanaappa tpptpptp 


4066 




ppappapn a ranrnrrzi 


1739 


tppppptp PPPTAPPTapa appa aPTPPTPP 


4067 


1857 


ppapttapa p ppppattpp 

OO.rtOUAV»»A O O^V — f"iUO^. 


1740 


ocTxTCCiccz ppPTAPPTapaappa TPTapTPP 


4068 


1 ft *"i 9 


APnapapp p rraTTnrnn 


1741 


CC*Cr'D< r V(^C r PPPTaPPTAPaAPPA PPTPTaPT 


4069 


1 ft 


anzvfinnpp a ttpppppap 

A^AoUok-V- A UoL-OOO-H.^- 


1742 


n^cccccTi pppTappTapaappa c m nr , cc r rn, r v 

L> 1 L.\^L.LrL~A VjIjL- IJiuL 1 ALAALAjA OOLLjL. 1 Li 1 


4070 


■J-Oul 


apppppaiT p oooc-ncon 

AoV-oLLAU o L-oooALLA 


1743 


Tccvccoa pppts pptb p a a 00 a a *vno cc pt 

IL10XL.LL.L1 L7LiU1ALiL.XAL«AALv3A A1L»VjL-VjL1 


4071 


lOCQ 

JLoOJ 


PATTPnppp a ppAPTTapa 

LAUoLooO A L»LAoUALA 


1744 


TPTRPTPP OO CV A C OT 1 A P* 7\ 7S. CO A CCCC C A TP 
X vj J. AL. X VjLi oLiL.1ALiL.1ALAAL.oA LL.L0LAI0 


4072 


lO / J 


COCCI. A PP A P TT&PArTPPP 
LoooALLA o UALAUoLo 


1745 


pppaTPTi pppTapPTAPA apva T , r , PT»r , r i r , r' 
k_oLAlolA 00L.IA0LIALAAL0A looloLLo 


4073 


1 ft7R 


ppappapn a panppppa 


1746 


tppppatp pppTapprapaappa aPTPPTPP 

XoLOv-AlO 00L.IAOL.IAUAAL0A AL-loolLL. 


4074 


10 / / 


appapuap a uppppapp 

ZiLLwUnL A UoLoLALL 


1747 


ooTCCCCh. pppTanPTapaappa cn±o w voo t v 

00 1 0L0LA 00L. 1 AoL 1 ALAALoA 0 1 AL. 1 00 1 


4075 


io / y 


papTtapaTT p ppprpppp 
UALjUAL.AU b bbLAbbbb 


1748 


OOOOT'COO COO r V7\CO r y7\ P 1 A A PP A 7\TPT7l PTP 

LLoo 1 0L0 L70L. 1 AoL. 1 ALAAL-oA A 1 0 1 AL 1 0 


4076 


1 OQ1 


pnaPATTPP p cts.cccccc 
0UALAU0U b LALLoobb 


1749 


OOOOOOrpO CCCVRCCVTiCTkTtCCTi PP71TPTRP 

LLLL-ooxo ooL.lAL»L.lAL>AALoA 0UAI0IAL 


4077 


lobJ 


apaitpppp a cccccct\c 
ALAUbLbL A L-LbbbbAb 


1750 


OT COO COO O C PT 1 A O Orp 7\ P> 7\ A CO T\ PPPP1\TPT 

LlL-LLLGo bbL.lAGL.lALAACoA GLGCAIGT 


4078 


18 93 


CCCCCT\ CC C PTTTTPPTTPT7 

LbbbbAbb b L-UULLUbU 


1751 


ACAGGAAG GGCTAGCxACAACGA CCTCCCCG 


4079 


T Q Pi Pt 

xyuu 


pppttttpptt c ttpttpttptttt 
LjoL.UUL.LU b UbUbUbUU 


1752 


A AP 1 AP'AP 1 A PPPTAPPTAPA 7\ pp 7\ Annnsppp 

AAL ALALIA 0GL.IAGLIALAALGA ALjoAAoLL. 


4080 


i q n o 


pttttppttptt c i tpt tpt tt tt tp 
LUULLUoU 0 U0U0UUU0 


1753 


PAAAPAPA PPPTfln/ "I'APTi A CC A A /""*A PP A A /"» 

LAAALALA ooLIALjLI ALAALoA ALAooAAo 


4081 


1 QPl/1 

x, yui 


T TP PT TP.T TPT T P TTPTTTTTTPPP 
UH~UoUoU 0 UljUUUuLL 


1754 


PPPA A A PA PPPTaPPTAPA APPA APAPAPPA 

00LAAALIA bbL 1 AoL. 1 ALAALoA ALAL-ALjLiA 


4082 


i an <c 


PTTPrTPTTPTT C 7 TT TT JO PP ATT 

LUbUbUbU Li UUUbbLAU 


1755 


AHPPPPAAA PPPfA PPTTl PA A PP A APAPAPAP 

AlooLAAA ooL-lAGLl ALAALoA ALALAUAo 


4083 




PTTPTTPTTTTTT C PP A TTPA A C 
bUbUbUUU b CL.AU CAAL 


1756 


C r V r T*OT\ r POO PPPTAPPTAPAAPPA AAAPAPAP 

olloAIoo GGuIAGlIALAALXjA AAACACAC 


4084 




T7P1TTTTTPPP A TTPA A C7\ A C 

UbUUUbLL. A ULAALAAL 


1757 


r^ r T l 'TO r T"T , C7\ PPPTAPPTAPAAPPA PPPAAAPA 
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OA A O 


P'TTP'P'TTP'P' A P* PPPAPTTPA 

GUGGUGGA G GLGAGUCA 


1874 


rr>r>7\ rtmrinri ripnmnririrnTinii unriTi mnmnrtun 

TGACTGGG GGCTAGCTACAACGA TCCAGCAG 


4202 


zftb J 


r*HT\HHnf*?i t~* TIPAPPP P'P* 

GGAGGGGA G UGAGLGGG 


1875 


f~*/-i/~i PP'Tir^ 7i PPPPmv P»P*n7\ P T\ Tk PP T\ mnnnornriri 

GGGGGIGA GGG1 AGG 1 AGAAGGA TGGGCTCC 


4203 


z45o 


PPPPAPTTP T\ 0/^/^/~'/-»P'P» A 

GLCCAGuG A CCCCGGGA 


1876 


TCCCGGGG GGCTAGCTACAACGA GACTGGGC 


4204 


9 A £ A 

z4 b4 


AGGGUGGG A GGGUGGGG 


1877 


GCCCACGG GGCTAGCTACAACGA CCCGGGGT 


4205 


z4b / 


/-^/-i/~i/->/-it\ /~\r-* /-1 T TP* PV-« P'P'P' A 

GGGGGACC G UGGGCXGA 


1878 


TCGGCCCA GGCTAGCTACAACGA GGTCCCGG 


4206 


2471 


GACCGUGG G CCGAGGUG 


1879 


CACCTCGG GGCTAGCTACAACGA CCACGGTC 


4207 


9 A T T 
Z** / / 


f* , rT~ , {~T'{~'Tif~' P" T TP 1 A P"T TP 1 P 1 A 

GGGGGGAG G UGAGUGGA 


1880 


I GLAGTCA GGCTAGCTACAACGA CTCGGCCC 


4208 


2480 


CCGAGGUG A CUGCAGAC 


1881 


GTCTGCAG GGCTAGCTACAACGA CACCTCGG 


4209 


2483 


AGGUGACU G CAGACCCU 


1882 


AGGGTCTG GGCTAGCTACAACGA AGTCACCT 


4210 


2487 


GACUGCAG A CCCUCCCA 


1883 


TGGGAGGG GGCTAGCTACAACGA CTGCAGTC 


4211 


2501 


CCAGGGAG G CUGUGCAC 


1884 


GTGCACAG GGCTAGCTACAACGA CTCCCTGG 


4212 


2504 


GGGAGGCU G UGCACAGA 


1885 


TCTGTGCA GGCTAGCTACAACGA AGCCTCCC 


4213 


2506 


GAGGCUGU G CACAGACU 


1886 


AGTCTGTG GGCTAGCTACAACGA ACAGCCTC 


4214 


2508 


GGCUGUGC A CAGACUGU 


1887 


ACAGTCTG GGCTAGCTACAACGA GCACAGCC 


4215 


2512 


GUGCACAG A CUGUCUUG 


1888 


CAAGACAG GGCTAGCTACAACGA CTGTGCAC 


4216 


2515 


CACAGACU G UCUUGAAC 


1889 


GTTCAAGA GGCTAGCTACAACGA AGTCTGTG 


4217 
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2522 


UGUCUUGA A CAUCCCAA 


1890 


TTGGGATG GGCTAGCTACAACGA TCAAGACA 


4218 


2524 


UCUUGAAC A UCCCAAAU 


1891 


ATTTGGGA GGCTAGCTACAACGA GTTCAAGA 


4219 


2531 


CAUCCCAA A UGCCACCG 


1892 


CGGTGGCA GGCTAGCTACAACGA TTGGGATG 


4220 


2533 


UCCCAAAU G CCACCGGA 


1893 


TCCGGTGG GGCTAGCTACAACGA ATTTGGGA 


4221 


2536 


CAAAUGCC A CCGGAACC 


1894 


GGTTCCGG GGCTAGCTACAACGA GGCATTTG 


4222 


2542 


CCACCGGA A CCCCAGCC 


1895 


GGCTGGGG GGCTAGCTACAACGA TCCGGTGG 


4223 


2548 


GAACCCCA G CCCUUAGC 


1896 


GCTAAGGG GGCTAGCTACAACGA TGGGGTTC 


4224 


2555 


AGCCCUUA G CUCCCCUC 


1897 


GAGGGGAG GGCTAGCTACAACGA TAAGGGCT 


4225 


2568 


CCUCCCAG G CCUCUGUG 


1898 


CACAGAGG GGCTAGCTACAACGA CTGGGAGG 


4226 


2574 


AGGCCUCU G UGGGCCCU 


1899 


AGGGCCCA GGCTAGCTACAACGA AGAGGCCT 


4227 


2578 


CUCUGUGG G CCCUUGUC 


1900 


GACAAGGG GGCTAGCTACAACGA CCACAGAG 


4228 


2584 


GGGCCCUU G UCGGGCAC 


1901 


GTGCCCGA GGCTAGCTACAACGA AAGGGCCC 


4229 


2589 


CUUGUCGG G CACAGAUG 


1902 


CATCTGTG GGCTAGCTACAACGA CCGACAAG 


4230 


2591 


UGUCGGGC A CAGAUGGG 


1903 


CCCATCTG GGCTAGCTACAACGA GCCCGACA 


4231 


2595 


GGGCACAG A UGGGAUCA 


1904 


TGATCCCA GGCTAGCTACAACGA CTGTGCCC 


4232 


2600 


CAGAUGGG A UCACAGUA 


1905 


TACTGTGA GGCTAGCTACAACGA CCCATCTG 


4233 


2603 


AUGGGAUC A CAGUAAAU 


1906 


ATTTACTG GGCTAGCTACAACGA GATCCCAT 


4234 


2606 


GGAUCACA G UAAAUUAU 


1907 


ATAATTTA GGCTAGCTACAACGA TGTGATCC 


4235 


2610 


CACAGUAA A UUAUUGGA 


1908 


TCCAATAA GGCTAGCTACAACGA TTACTGTG 


4236 


2613 


AGUAAAUU A UUGGAUGG 


1909 


CCATCCAA GGCTAGCTACAACGA AATTTACT 


4237 


2618 


AUUAUUGG A UGGUCUUG 


1910 


CAAGACCA GGCTAGCTACAACGA CCAATAAT 


423B 


2621 


AUUGGAUG G UCUUGAUC 


1911 


GATCAAGA GGCTAGCTACAACGA CATCCAAT 


4239 


2627 


UGGUCUUG A UCUUGGUU 


1912 


AACCAAGA GGCTAGCTACAACGA CAAGACCA 


4240 


2633 


UGAUCUUG G UUUUCGGC 


1913 


GCCGAAAA GGCTAGCTACAACGA CAAGATCA 


4241 


2640 


GGUUUUCG G CUGAGGGU 


1914 


ACCCTCAG GGCTAGCTACAACGA CGAAAACC 


4242 


2647 


GGCUGAGG G UGGGACAC 


1915 


GTGTCCCA GGCTAGCTACAACGA CCTCAGCC 


4243 


2652 


AGGGVGGG A CACGGUGC 


1916 


GCACCGTG GGCTAGCTACAACGA CCCACCCT 


4244 


2654 


GGUGGGAC A CGGUGCGC 


1917 


GCGCACCG GGCTAGCTACAACGA GTCCCACC 


4245 


2657 


GGGACACG G UGCGCGUG 


1918 


CACGCGCA GGCTAGCTACAACGA CGTGTCCC 


4246 


2659 


GACACGGU G CGCGUGUG 


1919 


CACACGCG GGCTAGCTACAACGA ACCGTGTC 


4247 


2661 


CACGGUGC G CGUGUGGC 


1920 


GCCACACG GGCTAGCTACAACGA GCACCGTG 


4248 


2663 


CGGUGCGC G UGUGGCCU 


1921 


AGGCCACA GGCTAGCTACAACGA GCGCACCG 


4249 


2665 


GUGCGCGU G UGGCCUGG 


1922 


CCAGGCCA GGCTAGCTACAACGA ACGCGCAC 


4250 


2668 


CGCGUGUG G CCUGGCAU 


1923 


ATGCCAGG GGCTAGCTACAACGA CACACGCG 


4251 


2673 


GUGGCCUG G CAUGAGGU 


1924 


ACCTCATG GGCTAGCTACAACGA CAGGCCAC 


4252 


2675 


GGCCUGGC A UGAGGUAU 


1925 


ATACCTCA GGCTAGCTACAACGA GCCAGGCC 


4253 


2680 


GGCAUGAG G UAUGUCGG 


1926 


CCGACATA GGCTAGCTACAACGA CTCATGCC 


4254 


2682 


CAUGAGGU A UGUCGGAA 


1927 


TTCCGACA GGCTAGCTACAACGA ACCTCATG 


4255 


2684 


UGAGGUAU G UCGGAACC 


1928 


GGTTCCGA GGCTAGCTACAACGA ATACCTCA 


4256 


2690 


AUGUCGGA A CCUCAGGC 


1929 


GCCTGAGG GGCTAGCTACAACGA TCCGACAT 


4257 


2697 


AACCUCAG G CCUGUCCA 


1930 


TGGACAGG GGCTAGCTACAACGA CTGAGGTT 


4258 


2701 


UCAGGCCU G UCCAGCCC 


1931 


GGGCTGGA GGCTAGCTACAACGA AGGCCTGA 


4259 


2706 


CCUGUCCA G CCCUGGGC 


1932 


GCCCAGGG GGCTAGCTACAACGA TGGACAGG 


4260 


2713 


AGCCCUGG G CUCUCCAU 


1933 


ATGGAGAG GGCTAGCTACAACGA CCAGGGCT 


4261 


2720 


GGCUCUCC A UAGCCUUU 


1934 


AAAGGCTA GGCTAGCTACAACGA GGAGAGCC 


4262 


2723 


UCUCCAUA G CCUUUGGG 


1935 


CCCAAAGG GGCTAGCTACAACGA TATGGAGA 


4263 


2740 


AGGGGGAG G UUGGGAGA 


1936 


TCTCCCAA GGCTAGCTACAACGA CTCCCCCT 


4264 


2750 


UGGGAGAG G CCGGVCAG 


1937 


CTGACCGG GGCTAGCTACAACGA CTCTCCCA 


4265 


2754 


AGAGGCCG G UCAGGGGU 


1938 


ACCCCTGA GGCTAGCTACAACGA CGGCCTCT 


4266 


2761 


GGUCAGGG G UCUGGGCU 


1939 


AGCCCAGA GGCTAGCTACAACGA CCCTGACC 


4267 


2767 


GGGUCUGG G CUGUGGUG 


1940 


CACCACAG GGCTAGCTACAACGA CCAGACCC 


4268 


2770 


UCUGGGCU G UGGUGCUC 


1941 


GAGCACCA GGCTAGCTACAACGA AGCCCAGA 


4269 
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z / /o 


ppppttpttp r* ttppttpttptt 
GGGGUVjUij G UGCUCUCU 


1942 


AGAGAGCA GGC1AGCTACAACGA CACAGCCC 


4270 


2775 


ppttpttpptt 0 pttpttpttpp 

bLUbUuuU Va UUUUUUV-.L. 


1943 


PPAPaPAP PPPTAPPTA PA A PP A 7\ PnAPTVnP 

Vj^AVjAVjAG obL J. AGL1 AUAAUGA ALCAUAGU 


4271 


OTP ft 
z / a o 


Pttppttppp 0 r i nTT/ -, r»r , r , r' 


1944 


GGGGUAGG GGL I AGG I ALAAGGA GGGAGGAG 


4272 


z / yz 


ttpppppptt 0 ppppapttp 

UULLVjjUUU \j UGGGAVjUtjr 


1945 


LAG1GGGG GGG1AGL1AGAAGGA AGGGGGGA 


4273 


Z / J O 


rTTnpprPA r* ttpttppapp 

V_UVsLUV-V-./\ Va UbULLHLb 


1946 


PPTPPAPA PPPTAPPTTi pati ppii TPPPPPIvn 

GG 1 GGAGA GG G 1 AG G I AGAAGG A I GGGG LAG 


4274 


o ft nr» 

Z O U U 


pppppaptt r> iTPPBrrfiP 


1947 


GGGG1GGA GGCTAGCTACAACGA ACTGGGGC 


4275 


o ft n/i 


PAPTTPTTPP A PPPPTTTTPTT 
V_AVjUVjUV_U A V_AjVjL-UUV_U 


1948 


A f A A f* p»p* PPPT7\PPiTiT\P7nvppTv ormnTvnmo 
AGAAGG GG GGGJL AGG1AGAAGGA GGALACTG 


4276 


z o u / 


TTPTTPPA PP (~1 PTTTTPTTPPP 


1949 


L»V_VJAVjAA\j» GGG1AGGIAGAAGGA CGIGGACA 


4277 


Z O J. 1 


PPPTTITPTTP P PaPTiPAPP 
oVjjV— UULUVa Vj UAvjAoAVjL 


1950 


PPTPTPT'P PPPTUPPTA PA A PP A PAPAAPnn 

GU1L1L.IG GGL.1AGG1AGAAGGA CAGAAGCC 


4278 


Z D Z ± 


pppnpjvcn p pttpttppa p 
VjGLAGAGA G GUCUGGAC 


1951 


GTGGAGAG GGCTAGCTACAACGA TCTCTGCC 


4279 


z o zo 


appttpttpp a pa Bprapp 

AoUUGUGG A UAAVjL.AV.jIj 


1952 


PPTP P" I'M IP PPPrPAPPrnv Pi\ 7\ Pn* nPnPRPom 

GG1GLTTG GGCTAGCTACAACGA CCAGAGCT 


4280 


*") D 1 O 
Z D JZ 


pttppapaa p PAPpr>jvnj\ 
GUGGAUAA G CAGGCAGA 


1953 


TCTGCCTG GGCTAGCTACAACGA TTGTCCAG 


4281 


ZOOD 


BPAAPPAP P PAPATTPATT 

ALAA0LA0 Vj UAGAUGAU 


1954 


A TV A IPPtrripi PPprpnrt/ , imiirij\i\o/-i» nmn /~imm/-tm 

A 1 GA 1 C TG GG C 1 AGCTACAACGA CTG CTTGT 


4282 


o ft a. n 


PPSPPPftP A TTPATTAAPP 

GCAGGGAG A UUAUAAGG 


1955 


CCTTATGA GGCTAGCTACAACGA CTG C CTG C 


4283 


O ft A "1 
Z Oft J 


PPPAPATTP A TTA APPJP71 

VjoUAVaAUU A UAAGGAUA 


1956 


TGTGGTTA GGCTAGCTACAACGA GATCTGCC 


4284 


z o*±y 


TTPATTAAPP A PAPAPAPP 

UGAUAAGG A CAGAGAGL 


1957 


GCTCTCTG GGCTAGCTACAACGA CCTTATGA 


4285 


"9 ft ^fi 

ZO JO 


pa pap bp a p pttttapttptt 
VjAUAVjAVjA G GUUALUGU 


1958 


A PAPTA AP< PPPTAnnmii pa a p/~ia m/~impmpm/^ 

AGAGiAAG GGCTAGCTACAACGA TCTCTGTC 


4286 


i ft n 


P A P A P PT TT T A PTTPTTPPTTTT 
uAVjAuLUU A UUVjrUVjUUU 


1959 


T\ A /i /"*7\ /i TV /i/*i/**rpT\ /~1 y^m Tv. /*1 TV Ti /l/^ tv TV TV ^omnmn 

AAGLACAG GG CI AGCTACAACGA AAGCTCTC 


4287 


n q/*i 
Z D DO 


A PPTTTTA PIT P T TP PT TT TPT Y7\ 

AGCUUACU G UGCUUCUA 


1960 


TAGAAGCA GGCTAGCTACAACGA AGTAAGCT 


4288 


Z O DO 


PTTTTAPTTPTT P PT TT TPT TApri 

LUUALUGU G CUULUACC 


1961 


GGTAGAAG GGCTAGCTACAACGA ACAGTAAG 


4289 


ZD / l 


Attp PT TT TPT T A PP A APITAP 

GUGCUULU A CCAACUAG 


1962 


CTAGTTGG GGCTAGCTACAACGA AGAAGCAC 


4290 


ZD / D 


TTTipnappa a pttappipp 

UUGUAGUA A GUAGGAGG 


1963 


GGIGGTAG GGCTAGCTACAACGA TGGTAGAA 


4291 


O ft ftd. 


PTTAPPRPP P PPT TPPT TPP 
GUAGGAGG Vj GGULGUGG 


1964 


/~i/~r Tv 7\ PP Pm A PPm A P A A pnn PPmPPmurt 

GGAGGACG GGCTAGCTACAACGA CCTCCTAG 


4292 


ZOOD 


aPPAPPPP P T TPPT TPPT TP 
AVjvjAVjIjoL G ULGUGGUC 


1965 


/^i 7V /i/^ 1 T\ /i TV /i ry /irp tv /~i / in <tv /"i t\ tv tv /i/i/i/^m/i^irT» 

GAGGAGGA GGCTAGCTACAACGA GCCCTCCT 


4293 


O ft Q9 

z 0 y z 


P PPT TPPT TP P TTPPT TPPBP 


1966 


nrnnp tv r**t~** TV /™*ri r^rpTV ri rmi tv /"^ tv tv /*iy^t tv /i tv /i/i t\ /ip/i 

CTGGAGGA GGCTAGCTACAACGA CAGGACGC 


4294 


Z y U / 


APAPPPAP P TTPPTTTTTTPA 
AVjAVjVjVjAVjr L7 UVjAj U U U LA 


1967 


*n^T TV TV TV /I^TTV /^T/^l/ltTlTV /I TV TV TV /1^ TV /ImPI A/*im/'ui i 

TGAAACCA GGCTAGCTACAACGA CTCCCTCT 


4295 


Z71V 


PPPaPPTTP P TTTTT TP A PPP 
ooVjALrVa U Vj Lr UUU L ALiL»L» 


1968 


/l/^/^rrr/i tv tv TV /i/i ^irp 7V /i /irn TV /i TV TV /i/i tv /in nnrnnrtn 

C GUI G AAA GG CI AGCTACAACGA CACCTCCC 


4296 


oqi q 

Z31? 


TTTTTTPaPPP P T TT TP PPP A T T 


1969 


ATIPPPPA A PPP1P A PPtT»T\ P A T\ P/~t T\ /-iPPrriP A » n 

AIGGGCAA GGC TAGCTACAACGA CCCTGAAA 


4297 


Z 7Z 0 


PPTTT7PPPP a TTPTTPT TP PP 


1970 


PPPAPAPA PPPrn A PP»T» A P A A Po TV nn/noTi Tiz-in 

GGGAGAGA GGCTAGCTACAACGA CCCCAACC 


4298 


Tom 
z yju 


PPPPATTPTT P 7 TPPrTT 1 ! TP 
GGGGAUGU G UGGCGGUG 


1971 


CACCGGCA GGCTAGCTACAACGA AGATCCCC 


4299 


z y Jz 


ppniTPiir"TT p rr^r^r^z tpp p 
GGAUCuGU G CCGGUGGC 


1972 


GCCACCGG GGCTAGCTACAACGA ACAGATCC 


4300 


2 93 6 


CUGUGCCG G UGGCUCUG 


1973 


CAGAGCCA GGCTAGCTACAACGA CGGCACAG 


4301 


2939 


UGCCGGUG G CUCUGGUC 


1974 


GACCAGAG GGCTAGCTACAACGA CACCGGCA 


4302 


2945 


UGGCUCUG G UCUCUGCU 


1975 


AGCAGAGA GGCTAGCTACAACGA CAGAGCCA 


4303 


2951 


UGGUCULU G CUGGGAGC 


1976 


GCTCCCAG GGCTAGCTACAACGA AGAGACCA 


4304 


z yob 


UGCUGGGA G CCUUCUUG 


1977 


CAAGAAGG GGCTAGCTACAACGA TCCCAGCA 


4305 


z yo / 


LCUUCJUUG G CGGUGAGA 


1978 


TCTCACCG GGCTAGCTACAACGA CAAGAAGG 


4306 


2970 


UCUUGGCG G UGAGAGGC 


1979 


GCCTCTCA GGCTAGCTACAACGA CGCCAAGA 


4307 


2977 


GGUGAGAG G CAUCACCU 


1980 


AGGTGATG GGCTAGCTACAACGA CTCTCACC 


4308 


2979 


UGAGAGGC A UCACCUUU 


1981 


AAAGGTGA GGCTAGCTACAACGA GCCTCTCA 


4309 


2982 


GAGGCAUC A CCUUUCCU 


1982 


AGGAAAGG GGCTAGCTACAACGA GATGCCTC 


4310 


2992 


CUUUCCUG A CUUGCUCC 


1983 


GGAGCAAG GGCTAGCTACAACGA CAGGAAAG 


4311 


Z O 


PPTTPaPTTTT P miHCHT^r'n 


1984 


p p T'P Pp7\p PP PT> 7\ P /"irn A PA A PP H ATVPrriPH/^/~i 

GGIGGGAG GGC I AGCTACAACGA AAGTCAGG 


4312 


3003 


UGCUCCCA G CGUGAAAU 


1985 


ATTTCACG GGCTAGCTACAACGA TGGGAGCA 


4313 


3005 


CUCCCAGC G UGAAAUGC 


1986 


GCATTTCA GGCTAGCTACAACGA GCTGGGAG 


4314 


3010 


AGCGUGAA A UGCACCUG 


1987 


CAGGTGCA GGCTAGCTACAACGA TTCACGCT 


4315 


3012 


CGUGAAAU G CACCUGCC 


1988 


GGCAGGTG GGCTAGCTACAACGA ATTTCACG 


4316 


3014 


UGAAAUGC A CCUGCCAA 


1989 


TTGGCAGG GGCTAGCTACAACGA GCATTTCA 


4317 


3018 


AUGCACCU G CCAAGAAU 


1990 


ATTCTTGG GGCTAGCTACAACGA AGGTGCAT 


4318 


3025 


UGCCAAGA A UGGCAGAC 


1991 


GTCTGCCA GGCTAGCTACAACGA TCTTGGCA 


4319 


3028 


CAAGAAUG G CAGACAUA 


1992 


TATGTCTG GGCTAGCTACAACGA CATTCTTG 


4320 


3032 


AAUGGCAG A CAUAGGGA 


1993 


TCCCTATG GGCTAGCTACAACGA CTG CC ATT 


4321 
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3034 


UGGCAGAC A UAGGGACC 


1994 


GGTCCCTA GGCTAGCTACAACGA GTCTGCCA 


4322 


3040 


ACAUAGGG A CCCCGCCU 


1995 


AGGCGGGG GGCTAGCTACAACGA CCCTATGT 


4323 


3045 


GGGACCCC G CCUCCUGG 


1996 


CCAGGAGG GGCTAGCTACAACGA GGGGTCCC 


4324 


3054 


CCUCCUGG G CCUUCACA 


1997 


TGTGAAGG GGCTAGCTACAACGA CCAGGAGG 


4325 


3060 


GGGCCUUC A CAUGCCCA 


1998 


TGGGCATG GGCTAGCTACAACGA GAAGGCCC 


4326 


3062 


GCCUUCAC A UGCCCAGU 


1999 


ACTGGGCA GGCTAGCTACAACGA GTGAAGGC 


4327 


3064 


CUUCACAU G CCCAGUUU 


2000 


AAACTGGG GGCTAGCTACAACGA ATGTGAAG 


4328 


3069 


CAUGCCCA G UUUUCUUC 


2001 


GAAGAAAA GGCTAGCTACAACGA TGGGCATG 


4329 


3079 


UUUCUUCG G CUCUGUGG 


2002 


CCACAGAG GGCTAGCTACAACGA CGAAGAAA 


4330 


3084 


UCGGCUCU G UGGCCUGA 


2003 


TCAGGCCA GGCTAGCTACAACGA AGAGCCGA 


4331 


3087 


GCUCUGUG G CCUGAAGC 


2004 


GCTTCAGG GGCTAGCTACAACGA CACAGAGC 


4332 


3094 


GGCCUGAA G CGGUCUGU 


2005 


ACAGACCG GGCTAGCTACAACGA TTCAGGCC 


4333 


3097 


CUGAAGCG G UCUGUGGA 


2006 


TCCACAGA GGCTAGCTACAACGA CGCTTCAG 


4334 


3101 


AGCGGUCU G UGGACCUU 


2007 


AAGGTCCA GGCTAGCTACAACGA AGACCGCT 


4335 


3105 


GUCUGUGG A CCUUGGAA 


2008 


TTCCAAGG GGCTAGCTACAACGA CCACAGAC 


4336 


3114 


CCUUGGAA G UAGGGCUC 


2009 


GAGCCCTA GGCTAGCTACAACGA TTCCAAGG 


4337 


3119 


GAAGUAGG G CUCCAGCA 


2010 


TGCTGGAG GGCTAGCTACAACGA CCTACTTC 


4338 


3125 


GGGCUCCA G CACCGACU 


2011 


AGTCGGTG GGCTAGCTACAACGA TGGAGCCC 


4339 


3127 


GCUCCAGC A CCGACUGG 


2012 


CCAGTCGG GGCTAGCTACAACGA GCTGGAGC 


4340 


3131 


CAGCACCG A CUGGCCUC 


2013 


GAGGCCAG GGCTAGCTACAACGA CGGTGCTG 


4341 


3135 


ACCGACUG G CCUCAGGC 


2014 


GCCTGAGG GGCTAGCTACAACGA CAGTCGGT 


4342 


3142 


GGCCUCAG G CCUCUGCC 


2015 


GGCAGAGG GGCTAGCTACAACGA CTGAGGCC 


4343 


3148 


AGGCCUCU G CCUCAUUG 


2016 


CAATGAGG GGCTAGCTACAACGA AGAGGCCT 


4344 


3153 


UCUGCCUC A UUGGUGGU 


2017 


ACCACCAA GGCTAGCTACAACGA GAGGCAGA 


4345 


3157 


CCUCAUUG G UGGUCGGG 


2018 


CCCGACCA GGCTAGCTACAACGA CAATGAGG 


4346 


3160 


CAUUGGUG G UCGGGUAG 


2019 


CTACCCGA GGCTAGCTACAACGA CACCAATG 


4347 


3165 


GUGGUCGG G UAGCGGCC 


2020 


GGCCGCTA GGCTAGCTACAACGA CCGACCAC 


4348 


3168 


GUCGGGUA G CGGCCAGU 


2021 


ACTGGCCG GGCTAGCTACAACGA TACCCGAC 


4349 


3171 


GGGUAGCG G CCAGUAGG 


2022 


CCTACTGG GGCTAGCTACAACGA CGCTACCC 


4350 


3175 


AGCGGCCA G UAGGGCGU 


2023 


ACGCCCTA GGCTAGCTACAACGA TGGCCGCT 


4351 


3180 


CCAGUAGG G CGUGGGAG 


2024 


CTCCCACG GGCTAGCTACAACGA CCTACTGG 


4352 


3182 


AGUAGGGC G UGGGAGCC 


2025 


GGCTCCCA GGCTAGCTACAACGA GCCCTACT 


4353 


3188 


GCGUGGGA G CCUGGCCA 


2026 


TGGCCAGG GGCTAGCTACAACGA TCCCACGC 


4354 


3193 


GGAGCCUG G CCAUCCCU 


2027 


AGGGATGG GGCTAGCTACAACGA CAGGCTCC 


4355 


3196 


GCCUGGCC A UCCCUGCC 


2028 


GGCAGGGA GGCTAGCTACAACGA GGCCAGGC 


4356 


3202 


CCAUCCCU G CCUCCUGG 


2029 


CCAGGAGG GGCTAGCTACAACGA AGGGATGG 


4357 


3212 


CUCCUGGA G UGGACGAG 


2030 


CTCGTCCA GGCTAGCTACAACGA TCCAGGAG 


4358 


3216 


UGGAGUGG A CGAGGUUG 


2031 


CAACCTCG GGCTAGCTACAACGA CCACTCCA 


4359 


3221 


UGGACGAG G UUGGCAGC 


2032 


GCTGCCAA GGCTAGCTACAACGA CTCGTCCA 


4360 


3225 


CGAGGUUG G CAGCUGGU 


2033 


ACCAGCTG GGCTAGCTACAACGA CAACCTCG 


4361 


3228 


GGUUGGCA G CUGGUCCG 


2034 


CGGACCAG GGCTAGCTACAACGA TGCCAACC 


4362 


3232 


GGCAGCUG G UCCGUCUG 


2035 


CAGACGGA GGCTAGCTACAACGA CAGCTGCC 


4363 


3236 


GCUGGUCC G UCUGCUCC 


2036 


GGAGCAGA GGCTAGCTACAACGA GGACCAGC 


4364 


3240 


GUCCGUCU G CUCCUGCC 


2037 


GGCAGGAG GGCTAGCTACAACGA AGACGGAC 


4365 


3246 


CUGCUCCU G CCCCACUC 


2038 


GAGTGGGG GGCTAGCTACAACGA AGGAGCAG 


4366 


3251 


CCUGCCCC A CUCUCCCC 


2039 


GGGGAGAG GGCTAGCTACAACGA GGGGCAGG 


4367 


3261 


UCUCCCCC G CCCCUGCC 


2040 


GGCAGGGG GGCTAGCTACAACGA GGGGGAGA 


4368 


3267 


CCGCCCCU G CCCUCACC 


2041 


GGTGAGGG GGCTAGCTACAACGA AGGGGCGG 


4369 


3273 


CUGCCCUC A CCCUACCC 


2042 


GGGTAGGG GGCTAGCTACAACGA GAGGGCAG 


4370 


3278 


CUCACCCU A CCCUUGCC 


2043 


GGCAAGGG GGCTAGCTACAACGA AGGGTGAG 


4371 


3284 


CUACCCUU G CCCCACGC 


2044 


GCGTGGGG GGCTAGCTACAACGA AAGGGTAG 


4372 


3289 


CUUGCCCC A CGCCUGCC 


2045 


GGCAGGCG GGCTAGCTACAACGA GGGGCAAG 


4373 
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3291 


UGCCCCAC G CCUGCCUC 


2046 


GAGGCAGG GGCTAGCTACAACGA GTGGGGCA 


4374 


3295 


CCACGCCU G CCUCAUGG 


2047 


CCATGAGG GGCTAGCTACAACGA AGGCGTGG 


4375 


3300 


CCUGCCUC A UGGCUGGU 


2048 


ACCAGCCA GGCTAGCTACAACGA GAGGCAGG 


4376 


3303 


GCCUCAUG G CUGGUUGC 


2049 


GCAACCAG GGCTAGCTACAACGA CATGAGGC 


4377 


3307 


CAUGGCUG G UUGCUCUU 


2050 


AAGAGCAA GGCTAGCTACAACGA CAGCCATG 


4378 


3310 


GGCUGGUU G CUCUUGGA 


2051 


TCCAAGAG GGCTAGCTACAACGA AACCAGCC 


4379 


3319 


CUCUUGGA G CCUGGUAG 


2052 


CTACCAGG GGCTAGCTACAACGA TCCAAGAG 


4380 


3324 


GGAGCCUG G UAGUGUCA 


2053 


TGACACTA GGCTAGCTACAACGA CAGGCTCC 


4381 


3327 


GCCUGGUA G UGUCACUG 


2054 


CAGTGACA GGCTAGCTACAACGA TACCAGGC 


4382 


3329 


CUGGUAGU G UCACUGGC 


2055 


GCCAGTGA GGCTAGCTACAACGA ACTACCAG 


4383 


3332 


GUAGUGUC A CUGGCUCA 


2056 


TGAGCCAG GGCTAGCTACAACGA GACACTAC 


4384 


3336 


UGUCACUG G CUCAGCCU 


2057 


AGGCTGAG GGCTAGCTACAACGA CAGTGACA 


4385 


3341 


CUGGCUCA G CCUUGCUG 


2058 


CAGCAAGG GGCTAGCTACAACGA TGAGCCAG 


4386 


3346 


UCAGCCUU G CUGGGUAU 


2059 


ATACCCAG GGCTAGCTACAACGA AAGGCTGA 


4387 


3351 


CUUGCUGG G UAUACACA 


2060 


TGTGTATA GGCTAGCTACAACGA CCAGCAAG 


4388 


3353 


UGCUGGGU A UACACAGG 


2061 


CCTGTGTA GGCTAGCTACAACGA ACCCAGCA 


4389 


3355 


CUGGGUAU A CACAGGCU 


2062 


AGCCTGTG GGCTAGCTACAACGA ATACCCAG 


4390 


3357 


GGGUAUAC A CAGGCUCU 


2063 


AGAGCCTG GGCTAGCTACAACGA GTATACCC 


4391 


3361 


AUACACAG G CUCUGCCA 


2064 


TGGCAGAG GGCTAGCTACAACGA CTGTGTAT 


4392 


3366 


CAGGCUCU G CCACCCAC 


2065 


GTGGGTGG GGCTAGCTACAACGA AGAGCCTG 


4393 


3369 


GCUCUGCC A CCCACUCU 


2066 


AGAGTGGG GGCTAGCTACAACGA GGCAGAGC 


4394 


3373 


UGCCACCC A CUCUGCUC 


2067 


GAGCAGAG GGCTAGCTACAACGA GGGTGGCA 


4395 


3378 


CCCACUCU G CUCCAAGG 


2068 


CCTTGGAG GGCTAGCTACAACGA AGAGTGGG 


4396 


3388 


UCCAAGGG G CUUGCCCU 


2069 


AGGGCAAG GGCTAGCTACAACGA CCCTTGGA 


4397 


3392 


AGGGGCUU G CCCUGCCU 


2070 


AGGCAGGG GGCTAGCTACAACGA AAGCCCCT 


4398 


3397 


CUUGCCCU G CCUUGGGC 


2071 


GCCCAAGG GGCTAGCTACAACGA AGGGCAAG 


4399 


3404 


UGCCUUGG G CCAAGUUC 


2072 


GAACTTGG GGCTAGCTACAACGA CCAAGGCA 


4400 


3409 


UGGGCCAA G UUCUAGGU 


2073 


ACCTAGAA GGCTAGCTACAACGA TTGGCCCA 


4401 


3416 


AGUUCUAG G UCUGGCCA 


2074 


TGGCCAGA GGCTAGCTACAACGA CTAGAACT 


4402 


3421 


UAGGUCUG G CCACAGCC 


2075 


GGCTGTGG GGCTAGCTACAACGA CAGACCTA 


4403 


3424 


GUCUGGCC A CAGCCACA 


2076 


TGTGGCTG GGCTAGCTACAACGA GGCCAGAC 


4404 


3427 


UGGCCACA G CCACAGAC 


2077 


GTCTGTGG GGCTAGCTACAACGA TGTGGCCA 


4405 


3430 ' 


CCACAGCC A CAGACAGC 


2078 


GCTGTCTG GGCTAGCTACAACGA GGCTGTGG 


4406 


3434 


AGCCACAG A CAGCUCAG 


2079 


CTGAGCTG GGCTAGCTACAACGA CTGTGGCT 


4407 


3437 


CACAGACA G CUCAGUCC 


2080 


GGACTGAG GGCTAGCTACAACGA TGTCTGTG 


4408 


3442 


ACAGCUCA G UCCCCUGU 


2081 


ACAGGGGA GGCTAGCTACAACGA TGAGCTGT 


4409 


3449 


AGUCCCCU G UGUGGUCA 


2082 


TGACCACA GGCTAGCTACAACGA AGGGGACT 


4410 


3451 


UCCCCUGU G UGGUCAUC 


2083 


GATGACCA GGCTAGCTACAACGA ACAGGGGA 


4411 


3454 


CCUGUGUG G UCAUCCUG 


2084 


CAGGATGA GGCTAGCTACAACGA CACACAGG 


4412 


3457 


GUGUGGUC A UCCUGGCU 


2085 


AGCCAGGA GGCTAGCTACAACGA GACCACAC 


4413 


3463 


UCAUCCUG G CUUCUGCU 


2086 


AGCAGAAG GGCTAGCTACAACGA CAGGATGA 


4414 


3469 


UGGCUUCU G CUGGGGGC 


2087 


GCCCCCAG GGCTAGCTACAACGA AGAAGCCA 


4415 


3476 


UGCUGGGG G CCCACAGC 


2088 


GCTGTGGG GGCTAGCTACAACGA CCCCAGCA 


4416 


3480 


GGGGGCCC A CAGCGCCC 


2089 


GGGCGCTG GGCTAGCTACAACGA GGGCCCCC 


4417 


3483 


GGCCCACA G CGCCCCUG 


2090 


CAGGGGCG GGCTAGCTACAACGA TGTGGGCC 


4418 


3485 


CCCACAGC G CCCCUGGU 


2091 


ACCAGGGG GGCTAGCTACAACGA GCTGTGGG 


4419 


3492 


CGCCCCUG G UGCCCCUC 


2092 


GAGGGGCA GGCTAGCTACAACGA CAGGGGCG 


4420 


3494 


CCCCUGGU G CCCCUCCC 


2093 


GGGAGGGG GGCTAGCTACAACGA ACCAGGGG 


4421 


3511 


CUCCCAGG G CCCGGGUU 


2094 


AACCCGGG GGCTAGCTACAACGA CCTGGGAG 


4422 


3517 


GGGCCCGG G UUGAGGCU 


2095 


AGCCTCAA GGCTAGCTACAACGA CCGGGCCC 


4423 


3523 


GGGUUGAG G CUGGGCCA 


2096 


TGGCCCAG GGCTAGCTACAACGA CTCAACCC 


4424 


3528 


GAGGCUGG G CCAGGCCC 


2097 


GGGCCTGG GGCTAGCTACAACGA CCAGCCTC 


4425 
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3533 


UGGGCCAG G CCCUCUGG 


2098 


CCAGAGGG GGCTAGCTACAACGA CTGGCCCA 


4426 


3543 


CCUCUGGG A CGGGGACU 


2099 


AGTCCCCG GGCTAGCTACAACGA CCCAGAGG 


4427 


3549 


GGACGGGG A CUUGUGCC 


2100 


GGCACAAG GGCTAGCTACAACGA CCCCGTCC 


4428 


3553 


GGGGACUU G UGCCCUGU 


2101 


ACAGGGCA GGCTAGCTACAACGA AAGTCCCC 


4429 


3555 


GGACUUGU G CCCUGUCA 


2102 


TGACAGGG GGCTAGCTACAACGA ACAAGTCC 


4430 


3560 


UGUGCCCU G UCAGGGUU 


2103 


AACCCTGA GGCTAGCTACAACGA AGGGCACA 


4431 


3566 


CUGUCAGG G UUCCCUAU 


2104 


ATAGGGAA GGCTAGCTACAACGA CCTGACAG 


4432 


3573 


GGUUCCCU A UCCCUGAG 


2105 


CTCAGGGA GGCTAGCTACAACGA AGGGAACC 


4433 


3582 


UCCCUGAG G UUGGGGGA 


2106 


TCCCCCAA GGCTAGCTACAACGA CTCAGGGA 


4434 


3593 


GGGGGAGA G CUAGCAGG 


2107 


CCTGCTAG GGCTAGCTACAACGA TCTCCCCC 


4435 


3597 


GAGAGCUA G CAGGGCAU 


2108 


ATGCCCTG GGCTAGCTACAACGA TAGCTCTC 


4436 


3602 


CUAGCAGG G CAUGCCGC 


2109 


GCGGCATG GGCTAGCTACAACGA CCTGCTAG 


4437 


3604 


AGCAGGGC A UGCCGCUG 


2110 


CAGCGGCA GGCTAGCTACAACGA GCCCTGCT 


4438 


3606 


CAGGGCAU G CCGCUGGC 


2111 


GCCAGCGG GGCTAGCTACAACGA ATGCCCTG 


4439 


3609 


GGCAUGCC G CUGGCUGG 


2112 


CCAGCCAG GGCTAGCTACAACGA GGCATGCC 


4440 


3613 


UGCCGCUG G CUGGCCAG 


2113 


CTGGCCAG GGCTAGCTACAACGA CAGCGGCA 


4441 


3617 


GCUGGCUG G CCAGGGCU 


2114 


AGCCCTGG GGCTAGCTACAACGA CAGCCAGC 


4442 


3623 


UGGCCAGG G CUGCAGGG 


2115 


CCCTGCAG GGCTAGCTACAACGA CCTGGCCA 


4443 


3626 


CCAGGGCU G CAGGGACA 


2116 


TGTCCCTG GGCTAGCTACAACGA AGCCCTGG 


4444 


3632 


CUGCAGGG A CACUCCCC 


2117 


GGGGAGTG GGCTAGCTACAACGA CCCTGCAG 


4445 


3634 


GCAGGGAC A CUCCCCCU 


2118 


AGGGGGAG GGCTAGCTACAACGA GTCCCTGC 


4446 


3646 


CCCCUUUU G UCCAGGGA 


2119 


TCCCTGGA GGCTAGCTACAACGA AAAAGGGG 


4447 


3655 


UCCAGGGA A UACCACAC 


2120 


GTGTGGTA GGCTAGCTACAACGA TCCCTGGA 


4448 


3657 


CAGGGAAU A CCACACUC 


2121 


GAGTGTGG GGCTAGCTACAACGA ATTCCCTG 


4449 


3660 


GGAAUACC A CACUCGCC 


2122 


GGCGAGTG GGCTAGCTACAACGA GGTATTCC 


4450 


3662 


AAUACCAC A CUCGCCCU 


2123 


AGGGCGAG GGCTAGCTACAACGA GTGGTATT 


4451 


3666 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


3679 


UCUCUCCA G CGAACACC 


2125 


GGTGTTCG GGCTAGCTACAACGA TGGAGAGA 


4453 


3683 


UCCAGCGA A CACCACAC 


2126 


GTGTGGTG GGCTAGCTACAACGA TCGCTGGA 


4454 


3685 


CAGCGAAC A CCACACUC 


2127 


GAGTGTGG GGCTAGCTACAACGA GTTCGCTG 


4455 


3688 


CGAACACC A CACUCGCC 


2128 


GGCGAGTG GGCTAGCTACAACGA GGTGTTCG 


4456 


3690 


AACACCAC A CUCGCCCU 


2129 


AGGGCGAG GGCTAGCTACAACGA GTGGTGTT 


4457 


3694 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


3711 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3713 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3716 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


3718 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


3730 


CCCCUUCU G UCCAGGGG 


2134 


CCCCTGGA GGCTAGCTACAACGA AGAAGGGG 


4462 


3739 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3741 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3744 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


3746 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


3767 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3769 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3772 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


3774 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


3778 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


3795 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3797 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3800 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


3802 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


3806 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 
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3823 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3825 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3828 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


3830 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


3834 


CCACACUC G CCCUUCUG 


2137 


CAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4465 


3842 


GCCCUUCU G UCCAGGGG 


2138 


CCCCTGGA GGCTAGCTACAACGA AGAAGGGC 


4466 


3851 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3853 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3856 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


3858 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


3862 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


3879 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3881 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3884 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


3886 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


3890 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


3907 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3909 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3912 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


3914 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


3926 


CCCCUUCU G UCCAGGGG 


2134 


CCCCTGGA GGCTAGCTACAACGA AGAAGGGG 


4462 


3935 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3937 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3940 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


3942 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


3963 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3965 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


3968 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


3970 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


3991 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


3993 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGC TAC AACGA GTCCCCTG 


4459 


3996 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


3998 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


4002 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


4019 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4021 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4024 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


4026 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


4038 


CCCCUUCU G UCCAGGGG 


2134 


CCCCTGGA GGCTAGCTACAACGA AGAAGGGG 


4462 


4047 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4049 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4052 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


4054 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


4058 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


4075 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4077 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4080 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


4082 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


4086 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


4103 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4105 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4108 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 
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4110 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


4131 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4133 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4136 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


4138 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


4159 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4161 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4164 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


4166 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


4178 


CCCCUUCU G UCCAGGGG 


2134 


CCCCTGGA GGCTAGCTACAACGA AGAAGGGG 


4462 


4187 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4189 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4192 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


4194 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


4198 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


4215 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4217 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4220 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


4222 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


4243 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4245 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4248 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


4250 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


4271 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4273 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4276 


GGGACGCC A CACUCCCC 


2132 


GGGGAGTG GGCTAGCTACAACGA GGCGTCCC 


4460 


4278 


GACGCCAC A CUCCCCCU 


2133 


AGGGGGAG GGCTAGCTACAACGA GTGGCGTC 


4461 


4290 


CCCCUUCU G UCCAGGGG 


2134 


CCCCTGGA GGCTAGCTACAACGA AGAAGGGG 


4462 


4299 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4301 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4304 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


4306 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


4310 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


4327 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4329 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4332 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


4334 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


4338 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


4355 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4357 


CAGGGGAC G CCACACUC 


2131 


GAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4459 


4360 


GGGACGCC A CACUCGCC 


2135 


GGCGAGTG GGCTAGCTACAACGA GGCGTCCC 


4463 


4362 


GACGCCAC A CUCGCCCU 


2136 


AGGGCGAG GGCTAGCTACAACGA GTGGCGTC 


4464 


4366 


CCACACUC G CCCUUCUC 


2124 


GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 


4452 


4383 


UCCAGGGG A CGCCACAC 


2130 


GTGTGGCG GGCTAGCTACAACGA CCCCTGGA 


4458 


4385 


CAGGGGAC G CCACACUU 


2139 


AAGTGTGG GGCTAGCTACAACGA GTCCCCTG 


4467 


4388 


GGGACGCC A CACUUGCC 


2140 


GGCAAGTG GGCTAGCTACAACGA GGCGTCCC 


4468 


4390 


GACGCCAC A CUUGCCCU 


2141 


AGGGCAAG GGCTAGCTACAACGA GTGGCGTC 


4469 


4394 


CCACACUU G CCCUUCUG 


2142 


CAGAAGGG GGCTAGCTACAACGA AAGTGTGG 


4470 


4402 


GCCCUUCU G UCCAGGGA 


2143 


TCCCTGGA GGCTAGCTACAACGA AGAAGGGC 


4471 


4411 


UCCAGGGA A UGCCACAC 


2144 


GTGTGGCA GGCTAGCTACAACGA TCCCTGGA 


4472 


4413 


CAGGGAAU G CCACACUC 


2145 


GAGTGTGG GGCTAGCTACAACGA ATTCCCTG 


4449 


4416 


GGAAUGCC A CACUCCCC 


2146 


GGGGAGTG GGCTAGCTACAACGA GGCATTCC 


4473 
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4418 


aattpppap a pttppppptt 


2147 


AOOOOOAp PPPTAPPTAPA AOOA PTPPP1VTT 


4474 


" ^ >} J 


ttpttppppa p papppttpp 

UV-UV^V^V-V^tt. O l^rtoV-.v- u V-V_ 


2148 


pp AopPTn ppptzipptapa appa r vr > rT i r t 'h{?,& 


4475 


44^ ft 


ppppanpa p ppttpppap 

V_v„b.V_i*\0\ rt o Lb- Ub.L.O.H.0 


2149 


c T vr , nms.nn ooot'aoot'apa aooa tootoppo 

\- 1 t-LiLjAtjtj Vj»\jL-1AIjL.1ALAACIjA Hjv^HjLjuo 


4476 


■Tt O 


pppitpppa n ttpappapp 

OV».HJl_V-oA o UoAl_v-AoV- 


2150 


ppTPPTpa PooTaooTAPa aooa r nr , r , r , 7^r , r , r> 


4477 


4449 


TTPPPAPTTP a PPA PPT TT TP 


2151 


^ , '&'hH^ l 'v^ , r , ooptaootapa appa r , Ts.r v vHHr , Ti 


4478 


4453 


APTTPAPPA P PTTTTPPPPA 
■HOUOAV—V-A O LUULLLLii 


2152 


r vr , r , Hr'7\ ap prrTTvpfTA p a n r<p a tppt^papt 1 


4479 


4461 


PPIJUPPPP A TIPPAIJAPA 


2153 


TPTATPPA PPPTAPPTAPAAPPA PPPPAAPP 


4480 


*± *± D 3 


PPPPAT7PP A TTAPAPTTTTP 

b.V-V-V-AUV».0 A iOi^V— UUL 


2154 


PAAP-TPTA PPPTAPPT'APA APOA PP1TPPPP 


4481 


44fi9 


ATTPPATTAP A PTTTTPPPPA 


2155 


TPPPPAAP PPPTAOPTfiPA APPA PTBTPPBT 


4482 


447 Q 
**** / j? 


TTTTPPPPAP P PPAPPAPP 
UUUVyV»bAo o V^V-AOOAOV— 


2156 


PPTPPTPP PPPT" APPTAPA APPA PT'PPPPAA 
VaV3L.lA0L.iAV-AAL.LjA L. 1 L.VjLjLtAA 


4483 


44 ft fi 


pppp app a. p PPPTTPTTAP 
OOV-.V-AOOA O UUwU^>Unu 


2157 


PT'AO AOOO. OO OT" A P PT A P A A OO A TTTTtTTTi 
V».XMVjAL»LiLx OV3L.lAkjL.XALAAL.V5A IL.L.I00L.L 


4484 


44 Qfi 


PPTTPTTAPP P PTTPPPPPP 

V-V-UV-UAOO O V^UOV-VASOO 


2158 


C^r^CCC C'Tt.C* PPP r PAPPT l AP A AOP A PPT^APAPP 
V-LLbbUnb bub 1 AoL. 1 AL AAL-oA bblAbAub 


4485 


44 Q Q 


priziPPPPTT p nnnr^niirT 1 

V»UAoOOL.U O V-V-OOOUoL. 


2159 


p o a cccr^r^ ppptti ppt7> p a a PO A BPPPPTJIP 
LxL.AL.L.L.00 ooL.lAoL.lAL,AAL.GA AGL.L.L. i AG 


4486 


4Rft4 


PPTTPPPnn P 7TPPPAPPP 
OV-UOV-V-oo o UoLUiLLL 


2160 


p ppfrpp O A OOOTAOOTAP'A A OO A r^r^r i r > ( n, T i tr % r > 
ooolooL-A obLlAuL 1 ALAALbA LLbbLAbL 


4487 


ijUO 


ttpppp./"2ptt p pp a pppttp 

UOL.V-.OOOU o V-V-AV_UV^Uo 


2161 


/~i7\/~i/~i/~«rp/-i/-i OOOT" A OOT 1 A P A A OO A 71 f^r^C^r^r* C^'ft 

LAbbblbb ooL, X Abb X ALAALbA AL.LL.GGLA 


4488 




VJ000U0L.L. A ULtUuuLU 


2162 


7\/-i/~i/~i7\/^ipt/* K i OOOT>AOOT^A P A A OP A OPP7* f~*r*C*r* 

AGL.LAGGG GGL.1AGC1ALAACGA GGCACCCG 


4489 


/l ci c 


oo app ot to o ot top't tt too 
V-L-AL.L.L.UO Li CUL.L.UUCC 


2163 


GGAAGGAG GGCTAGCTACAACGA CAGGGTGG 


4490 




OT TOOT TF TOO A OAOPPTTPP 

GUL.L.UUL.L. A CACLGUGC 


2164 


GCACGGTG GGCTAGCTACAACGA GGAAGGAG 


4491 




PPTTTTPPAP A PPPTTPPfTP 
LLUULLAt A L.U0U0L.U0 


2165 


LAbLALbb oGLlAGLl AL-AACGA GTGGAAGG 


4492 


^ J6J 


TTPPAPAPP P TTPPTTPPT TP 
UV-V^AV»,AV».V- o UbUUbuUL 


2166 


PAPPAPPA OPOTA OOT" A O A A OP A PPTPTPPR 

bAL LAo bA oGU1AGL.1AL.AAL.oA GG1G1GGA 


4493 




papappptt o pttppttpap 
v»AuAL.b.oU o L.U00ULAL. 


2167 


OT>OAOOAO PPPTTi /~>0<T«7V 07\ 7V /~i^»7v t\ n/^nm/Trn 

GXGAGGAG GGC xAGCTALAACGA ACGGTGTG 


4494 


i J J j 


OOOT TO OT TO O T TO A OT TO OO 

L.L.0U0L.UG o UL,AL.UoL.L, 


2168 


fiOOAPTPA OOOrpA OOT»7\ P A 7\ OOTV 07\007\ OOz~< 

GGLAG1GA GGCTAGCTACAACGA CAGCACGG 


4495 




TTPPTTPPTTP A OT TO P OT TO O 
UoUUb/oUL A V-UoLV-UoL- 


2169 


POAOOOAO PPPT71 OOT" A PA A 007\ P7\PPJPPT\ 

GLAGGCAG GGC x AGC 1 ALAACGA GACCAGCA 


4496 




TTOOTTOAOTT O OOTTOOITOO 
UooULAUU 0 V^V-UoV,Uoo 


2170 


OOAP'P'AOO OOOT 1 A OPT 1 A OA A OOT\ T\ OT»07\ OOtv 

v-CAoLAoLj GGC 1 AGC x AGAALGA AG1GACCA 


4497 


4 R4 c: 


PAPTTPPPTT P PIIPPPPPP 


2171 


OOOOO PA O OOOT 1 A PPT 1 A PA A OO A AOOOAOT/™! 

GCCCCCAG GGClAGCxACAACGA AGGCAGTG 


4498 


4CC9 


TTPPT7PPPP (2 PPTTP7APATT 
UOLUOO00 Lj CoUV^AoAU 


2172 


AT^PT'PAPP OOOTA OOT 1 A OA A OO A /-i0007\/"" , 07\ 

A1C1GACG GGCTAGCTACAACGA CCCCAGCA 


4499 


4 R ^4 


PTTPPPPPP P TTPAPATTPP 
LUUUUUUL \J UV*_AOAUOV- 


2173 


PPAT'PTP A PPPTAPPTAPAAPPA r*t~T'f~TT'7ir' 
LiL.AXL.XoA OLiL.IALjL.1 Av-AAL.LjA OL.L.L.L.LAG 


4500 


4 R q q 


PPPPTTPAP A TTPPBPP7TP 
OOV-OUl»~rt.O A U0CA00U0 


2174 


r , nr % r* r pr , r , n Pootaootaoa aooa r^nr* a ooo/^ 
LALblbLA oGGlAGGl AGAACGA ClGACGCC 


4501 




OOTTOAO ATT O OAOOTTOAO 

V-0UL.A0AU 0 GA00U0AL, 


2175 


GTCACCTG GGCTAGCTACAACGA ATCTGACG 


4502 


deer 
*± J O D 


aoattppap n iipapppttp 

AOAUOV-Ao 0 UOAV-L.V-UO 


2176 


PAPPPTP7V P"<OOrp7\ OOT»7\ OTV 7\ oriTi /~irn/~t /~i 7\ m/-wn 

LAubblLA oGG 1 AGL. X AGAACGA C1GCAICT 


4503 




TTOOAOOTTO A OOOT TOT TOO 

U0LA00U0 A L-uGUGUGG 


2177 


GCACAGGG GGCTAGCTACAACGA CACCTGCA 


4504 


ART! 


OTTOAOOOTT O TTPPAPPAO 

LjUvjALI-GU 0 UuvAubAb 


2178 


CTCCTGCA GGCTAGCTACAACGA AGGGTCAC 


4505 




OAOOOITOTT O OAOOAOOTT 

GAGLLUGU G LAGGAGGU 


2179 


ACCTCCTG GGCTAGCTACAACGA ACAGGGTC 


4506 




TTOOAOOAO O TTATTOTTOTTO 

U0GAG0AG G UAULULUG 


2180 


CAGAGATA GGCTAGCTACAACGA CTCCTGCA 


4507 


ACQ/1 


OAOOAOOTT A T TOT TOT TOO 7V 

LAvabrAbrGU A UGUGUGGA 


2181 


TCCAGAGA GGCTAGCTACAACGA ACCTCCTG 


4508 


4592 


AUCUCUGG A CCUGCCUC 


2182 


GAGGCAGG GGCTAGCTACAACGA CCAGAGAT 


4509 


4596 


CUC5GACCU G CCUCUUGG 


2183 


CCAAGAGG GGCTAGCTACAACGA AGGTCCAG 


4510 


4604 


OOOTTOTTTT/^ /—1 tTmvTTTTH rtn 

UL.CULUUG G UCAUUACG 


2184 


CGTAATGA GGCTAGCTACAACGA CAAGAGGC 


4511 




TTOTTTTOOTTO A T TT 7 A OOOOO 

UL.UUbLiUL A UUACGGGG 


2185 


CCCCGTAA GGCTAGCTACAACGA GACCAAGA 


4512 


4610 


ttootto7\ t ttt t\ nnnr»r<rir m 

UGGUCAUU A CGGGGCUG 


2186 


CAGCCCCG GGCTAGCTACAACGA AATGACCA 


4513 


4615 


7\ ttt T7\ 00 /~i r\ nTTncinrmn 

AUUACGGG G CUGGGCAG 


2187 


CTGCCCAG GGCTAGCTACAACGA CCCGTAAT 


4514 


4620 


GGGGCVGG G CAGGGCCU 


2188 


AGGCCCTG GGCTAGCTACAACGA CCAGCCCC 


4515 


4625 


TTPPPPAPP P PPI TPPTT2XTT 


2189 


ATAOPAPP OOOrPA OOT^A OA A /"i/^'A r%rwT*(~*r%r*/Ti\. 

AlALLAbb vaoL-XAGGl ALAACGA CC1GCCCA 


4516 


4630 


AGGGCCUG G UAUCAGGG 


2190 


CCCTGATA GGCTAGCTACAACGA CAGGCCCT 


4517 


4632 


GGCCUGGU A UCAGGGCC 


2191 


GGCCCTGA GGCTAGCTACAACGA ACCAGGCC 


4518 


4638 


GUAUCAGG G CCCCGCUG 


2192 


CAGCGGGG GGCTAGCTACAACGA CCTGATAC 


4519 


4643 


AGGGCCCC G CUGGGGUU 


2193 


AACCCCAG GGCTAGCTACAACGA GGGGCCCT 


4520 


4649 


CCGCUGGG G UUGCAGGG 


2194 


CCCTGCAA GGCTAGCTACAACGA CCCAGCGG 


4521 


4652 


CUGGGGUU G CAGGGCUG 


2195 


CAGCCCTG GGCTAGCTACAACGA AACCCCAG 


4522 


4657 


GUUGCAGG G CUGGGCCU 


2196 


AGGCCCAG GGCTAGCTACAACGA CCTGCAAC 


4523 


4662 


AGGGCUGG G CCUGUGCU 


2197 


AGCACAGG GGCTAGCTACAACGA CCAGCCCT 


4524 


4666 


CUGGGCCU G UGCUGUGG 


2198 


CCACAGCA GGCTAGCTACAACGA AGGCCCAG 


4525 
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4668 


GGGCCUGU G CUGUGGUC 


2199 


GACCACAG GGCT AG CTACAACGA ACAGGCCC 


4526 


4671 


CCUGUGCU G UGGUCCUG 


2200 


CAGGACCA GGCTAGCTACAACGA AGCACAGG 


4527 


4674 


GUGCUGUG G UCCUGGGG 


2201 


CCCCAGGA GGCTAGCTACAACGA CACAGCAC 


4528 


4682 


GUCCUGGG G UGUCCAGG 


2202 


CCTGGACA GGCTAGCTACAACGA CCCAGGAC 


4529 


4684 


CCUGGGGU G UCCAGGAC 


2203 


GTCCTGGA GGCTAGCTACAACGA ACCCCAGG 


4530 


4691 


UGUCCAGG A CAGACGUG 


2204 


CACGTCTG GGCTAGCTACAACGA CCTGGACA 


4531 


4695 


CAGGACAG A CGUGGAGG 


2205 


CCTCCACG GGCTAGCTACAACGA CTGTCCTG 


4532 


4697 


GGACAGAC G UGGAGGGG 


2206 


CCCCTCCA GGCTAGCTACAACGA GTCTGTCC 


4533 


4705 


GUGGAGGG G UCAGGGCC 


2207 


GGCCCTGA GGCTAGCTACAACGA CCCTCCAC 


f 4534 


4711 


GGGUCAGG G CCCAGCAC 


2208 


GTGCTGGG GGCTAGCTACAACGA CCTGACCC 


4535 


4716 


AGGGCCCA G CACCCCUG 


2209 


CAGGGGTG GGCTAGCTACAACGA TGGGCCCT 


4536 


4718 


GGCCCAGC A CCCCUGCU 


2210 


AGCAGGGG GGCTAGCTACAACGA GCTGGGCC 


4537 


4724 


GCACCCCU G CUCCAUGC 


2211 


GCATGGAG GGCTAGCTACAACGA AGGGGTGC 


4538 


4729 


CCUGCUCC A UGCUGAAC 


2212 


GTTCAGCA GGCTAGCTACAACGA GGAGCAGG 


4539 


4731 


UGCUCCAU G CUGAACUG 


2213 


CAGTTCAG GGCTAGCTACAACGA ATGGAGCA 


4540 


4736 


CAUGCUGA A CUGUGGGA 


2214 


TCCCACAG GGCTAGCTACAACGA TCAGCATG 


4541 


4739 


GCUGAACU G UGGGAAGC 


2215 


GCTTCCCA GGCTAGCTACAACGA AGTTCAGC 


4542 


4746 


UGUGGGAA G CAUCCAGG 


2216 


CCTGGATG GGCTAGCTACAACGA TTCCCACA 


4543 


4748 


UGGGAAGC A UCCAGGUC 


2217 


GACCTGGA GGCTAGCTACAACGA GCTTCCCA 


4544 


4754 


GCAUCCAG G UCCCUGGG 


2218 


CCCAGGGA GGCTAGCTACAACGA CTGGATGC 


4545 


4762 


GUCCCUGG G UGGCUUCA 


2219 


TGAAGCCA GGCTAGCTACAACGA CCAGGGAC 


4546 


4765 


CCUGGGUG G CUUCAACA 


2220 


TGTTGAAG GGCTAGCTACAACGA CACCCAGG 


4547 


4771 


UGGCUUCA A CAGGAGUU 


2221 


AACTCCTG GGCTAGCTACAACGA TGAAGCCA 


4548 


4777 


CAACAGGA G UUCCAGCA 


2222 


TGCTGGAA GGCTAGCTACAACGA TCCTGTTG 


4549 


4783 


GAGUUCCA G CACGGGAA 


2223 


TTCCCGTG GGCTAGCTACAACGA TGGAACTC 


4550 


4785 


GUUCCAGC A CGGGAACC 


2224 


GGTTCCCG GGCTAGCTACAACGA GCTGGAAC 


4551 


4791 


GCACGGGA A CCACUGGA 


2225 


TCCAGTGG GGCTAGCTACAACGA TCCCGTGC 


4552 


4794 


CGGGAACC A CUGGACAA 


2226 


TTGTCCAG GGCTAGCTACAACGA GGTTCCCG 


4553 


4799 


ACCACUGG A CAACCUGG 


2227 


CCAGGTTG GGCTAGCTACAACGA CCAGTGGT 


4554 


4802 


ACUGGACA A CCUGGGGU 


2228 


ACCCCAGG GGCTAGCTACAACGA TGTCCAGT 


4555 


4809 


AACCUGGG G UGUGUCCU 


2229 


AGGACACA GGCTAGCTACAACGA CCCAGGTT 


4556 


4811 


CCUGGGGU G UGUCCUGA 


2230 


TCAGGACA GGCTAGCTACAACGA ACCCCAGG 


4557 


4813 


UGGGGUGU G UCCUGAUC 


2231 


GATCAGGA GGCTAGCTACAACGA ACACCCCA 


4558 


4819 


GUGUCCUG A UCUGGGGA 


2232 


• TCCCCAGA GGCTAGCTACAACGA CAGGACAC 


4559 


4827 


AUCUGGGG A CAGGCCAG 


2233 


CTGGCCTG GGCTAGCTACAACGA CCCCAGAT 


4560 


4831 


GGGGACAG G CCAGCCAC 


2234 


GTGGCTGG GGCTAGCTACAACGA CTGTCCCC 


4561 


4835 


ACAGGCCA G CCACACCC 


2235 


GGGTGTGG GGCTAGCTACAACGA TGGCCTGT 


4562 


4838 


GGCCAGCC A CACCCCGA 


2236 


TCGGGGTG GGCTAGCTACAACGA GGCTGGCC 


4563 


4840 


CCAGCCAC A CCCCGAGU 


2237 


ACTCGGGG GG CTAGCTACAACG A GTGGCTGG 


4564 


4847 


CACCCCGA G UCCUAGGG 


2238 


CCCTAGGA GGCTAGCTACAACGA TCGGGGTG 


4565 


4856 


UCCUAGGG A CUCCAGAG 


2239 


CTCTGGAG GGCTAGCTACAACGA CCCTAGGA 


4566 


4866 


UCCAGAGA G CAGCCCAC 


2240 


GTGGGCTG GGCTAGCTACAACGA TCTCTGGA 


4567 


4869 


AGAGAGCA G CCCACUGC 


2241 


GCAGTGGG GGCTAGCTACAACGA TGCTCTCT 


4568 


4873 


AGCAGCCC A CUGCCCUG 


2242 


CAGGGCAG GGCTAGCTACAACGA GGGCTGCT 


4569 


4876 


AGCCCACU G CCCUGGGC 


2243 


GCCCAGGG GGCTAGCTACAACGA AGTGGGCT 


4570 


4883 


UGCCCUGG G CUCCACGG 


2244 


CCGTGGAG GGCTAGCTACAACGA CCAGGGCA 


4571 


4888 


UGGGCUCC A CGGAAGCC 


2245 


GGCTTCCG GGCTAGCTACAACGA GGAGCCCA 


4572 


4894 


CCACGGAA G CCCCCUCA 


2246 


TGAGGGGG GGCTAGCTACAACGA TTCCGTGG 


4573 


4902 


GCCCCCUC A UGCCGCUA 


2247 


TAGCGGCA GGCTAGCTACAACGA GAGGGGGC 


4574 


4904 


CCCCUCAU G CCGCUAGG 


2248 


CCTAGCGG GGCTAGCTACAACGA ATGAGGGG 


4575 


4907 


CUCAUGCC G CUAGGCCU 


2249 


AGGCCTAG GGCTAGCTACAACGA GGCATGAG 


4576 


4912 


GCCGCUAG G CCUUGGCC 


2250 


GGCCAAGG GGCTAGCTACAACGA CTAGCGGC \ 


4577 
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4918 


7\ P'P* pipit it tp> p 1 r*r~*nr~*r^f~*i r ~*r~* 
AGGCCUUG G CCUCGGGG 


2251 


pipipipipi AP'Pi piptpirpTV P , P ,r P7\ P 1 A A PIP 1 A Pitv APPPpT 

C LCCGAGG GGC J. AGC 1 ALAALbA CAAGGCC JL 


4578 


4927 


CCUCGGGG A CAGCCCAG 


2252 


p!rTV"«P»P«Pimpi /^/-t/~irp7\ j^»/-trp7> piTV 7\ p»pi 7\ pipipiP'Pi 7\pr 

C 1 GGG C J. G GG C 1 AGC 1 ACAACGA C C CCGAGG 


4579 


4930 


/^/^l/-i/-i^»7\ P17\ pi /"i /-"1/-1 TV piPTTA 

CGGGGACA G CCCAGCUA 


2253 


IAGC1GGG GGC 1 AGC 1 ACAACGA 1GJLCCCCG 


4580 


4935 


t\ pitv pi pi pipi a pi nrTR^nriPA 

ACAGCCCA G CUAGGCCA 


2254 


TPPPPTTiP PlpiPirp A PipTi A P» A A PV~i A TipiPlpi f^rri^irn 

XGGCCiAG GGC 1 AGC 1 ACAACGA 1GGGC1G1 


4581 


4940 


n/^H /~1 rf^lT 7 "TV PI Pi P1P171 OT 7PTTP 

CCAGCUAG G CCAGUGUG 


2255 


PTlPTiPTPP P<Plp""n A p/'liliTi pi A A P'P' A PTIVPPTPP 

CACACTGG GGCTAGCTACAACGA CTAGCTGG 


4582 


4 944 


CUAGGCCA G UGUGUGGC 


2256 


pipipiA nj\p3\ rPPTRPPTTiPTiRPPR r PpiPipiP""Ti A P< 

bLLALALA GGC i AGC 1 ALAALbA -LGGCC1AG 


4583 


4946 


AP1P 1 P 1 P 1 AP 1 TT /I "t T/— 1 T TP1 fipj\p 

AGGCCAGU G UGUGGCAvj 


2257 


P»n-»pipipiA p»A PPPTAP PTAPA A CC^ A liPT'PPPPT 1 

CJLGCCACA GGC XAoCl ACAACGA ACJ.GGCC1 


4584 


4948 


PPPAPTTPTT Pi T TP* P'P* A P'P' A 

GCCAGUGU G UGGCAGGA 


2258 


rppipirrtpipipiA PpprpTiP P"! 1 A P 1 A A P'P 1 A A f" 1 A PTPP pi 

1 CC lbL LA GG C 1 AGC 1 ACAACGA AGAC I GGC 


4585 


A OC 1 

4951 


APTTPTTPTTP P7\PP>\PP7i 

AGUGUGUG G CAGGACCA 


2259 


rppipimPipiiT»Pi pipi Pirn A f~ , t~ %r V A Pi A A PV A PUPAPAPT 1 

1GG1CC1G GGCJ.AGC1ACAALGA CACACAC1 


4586 


4956 


pit ipipipi a c^c* a p<pia pipip*pipi 
GUGGCAGG A CCAGGCCC 


2260 


pipipipi pirn pipi /-i/-i/-»rp t\ pi pirn A pi A A P'P 1 A PiPifppiPipiTv pi 

GGGCL.1GG GGCIAGC1ACAACGA CCJ.GCCAC 


4587 


4961 


TV piplTV PPTV PI P» PPPPPTiTTP 

AGGACCAG G CCCCCAUG 


2261 


/-I7\ rp p»pt pipi Pi PPPTAPPTftPTtTtPPA P1 r PP1pi'TiPlPlT1 

CATGGGGG GGCTAGC I ACAACGA CI GGTCC 1 


4588 


4967 


7\ f<rT*f~l/~*/~1r~i A T TP 1 T TP" P'P* A P* 

AGGCCCCL A UGUGGGAG 


2262 


PIT" Pi Pi Pi A pi A p« pi pirn A PPfA pi A A P'P* A pi pi pi pi pi pipirp 

CTCCCACA GGCTAGCTACAACGA GGGGGCCT 


4589 


4969 


PipiPiP»piPi A T T P 1 T1PPP A PiP'TT 

GCCCCCAU G UGGGAGCU 


2263 


71 pipirp/-i/-ipi7\ pipiP"T , 7v pipifTiA P*<A A PIP 1 A A r TT*f r ^r , /^ , r % 

AGCTLCCA GGCTAGC 1 ACAACGA A1GGGGGC 


4590 


4975 


7\ T TpifTplplp 1 7\ P 1 PITTPI A PiPlpipi 

AUGUGGGA G CUGACCCC 


2264 


/iPiPipiTipiA Pi ppiprnTi pprtiTi PTt Ti pip 71 mpipipi A Pi A T> 

GGGGTCAG GGCTAGCTACAACGA TCCCACAT 


4591 


4979 


PIPIPIA PipiTTPI A piplpipiTTTTpipi 

GGGAGCUG A CCCCUUGG 


2265 


pi Pi tv A Pi Pi Pi pi pi pi pirn tv pi pumv pi tv tv Pipi A pi A Pi Pirn pipi p» 

CCAAGGGG GGCTAGCTACAACGA CAGCTCCC 


4592 


4989 


plp1i*"1T TT T/"^P"1P1 TV TTTTP"1TTP1P»TV P» 

CCCUUGGG A UUCUGGAG 


2266 


PimplplA pi A A PIPIPITI AP1p1»T»A P1A A Pipi A P»P1pi A A Plpip* 

CTCCAGAA GGCTAGCTACAACGA CCCAAGGG 


4593 


4997 


AUUCUGGA G CUGUGCUG 


2267 


CAGCACAG GGCTAGCTACAACGA TCCAGAAT 


4594 


5000 


nrT^nTi nnti t~\ ttpiptttpitv ttpi 

CUGGAGCU G UGCUGAUG 


2268 


Pi TV m Pi TV PI PI TV nplPlfTITV P1PMTITV PITV TV P1/~»TV TV PI P»m PI Pi TV PI 

CATCAGCA GGCTAGCTACAACGA AGCTCCAG 


4595 ! 


5002 


PI PI TV Pi P»T TPT T P" P»T TP* TV I TPPP 

GGAGCUGU G CUGAUGGG 


2269 


Pi PI PI TV rn PI A PI PIPIPimTV P'PWTITV P^TV TV PtPITV TV PITV PI pimpipi 

CCCATCAG GGCTAGCTACAACGA ACAGCTCC 


4596 


5006 


CUGUGCUG A UGGGCAGG 


2270 


/"If /~1 /l/^ TV /"*trrtTv /*t /*imT\ /^TV TV TV /^l TV /"tTV ^* 

CCTGCCCA GGCTAGCTACAACGA CAGCACAG 


4597 


5010 


GCUGAUGG G CAGGGGAG 


2271 


CTCCCCTG GGCTAGCTACAACGA CCATCAGC 


4598 


5020 


t\ ymnn ni tv /~f nnTv /t/*it tv*i/t 

AGGGGAGA G CCAGCUCC 


2272 


GGAGCTGG GGCTAGCTACAACGA TCTCCCCT 


4599 


5024 


GAGAGCCA G CUCCUCCC 


2273 


GGGAGGAG GGCTAGCTACAACGA TGGCTCTC 


4600 


5044 


GAGGGAGG G UCUUGAUG 


2274 


yin m p**i ts ta /™i tv /"i/inrriA rtnmi\ nii tv /t/— 1 tv nAfn/*iyi/irn/^ 

CATCAAGA GGCTAGCTACAACGA CCTCCCTC 


4601 


5050 


GGGUCUUG A UGCCUGGG 


2275 


/rnm\ ^1 tv pinmi\ / tin ta /"vtv tv /~i^t tv v^t t\ t\ r% /'inn 

CCCAGGCA GGCTAGCTACAACGA CAAGACCC 


4602 


5052 


GUCUUGAU G CCUGGGGU 


2276 


ACCCCAGG GGCTAGCTACAACGA ATCAAGAC 


4603 


5059 


UGCCUGGG G UUACCCGC 


2277 


GCGGGTAA GGCTAGCTACAACGA CCCAGGCA 


4604 


5062 


CUGGGGUU A CCCGCAGA 


2278 


TCTGCGGG GGCTAGCTACAACGA AACCCCAG 


4605 


5066 


GGUUACCC G CAGAGGCC 


2279 


GGCCTCTG GGCTAGCTACAACGA GGGTAACC 


4606 


5072 


CCGCAGAG G CCUGGGUG 


2280 


CACCCAGG GGCTAGCTACAACGA CTCTGCGG 


4607 


5078 


AGGCCUGG G UGCCGGGA 


2281 


TCCCGGCA GGCTAGCTACAACGA CCAGGCCT 


4608 


5080 


/i /">nr Tn /■> n t T n r^on/nimv 

GCCUGGGU G CCGGGACG 


2282 


y p mm/*m/^/^i/ p r /"l/'mnr tv n /*rm tv r~~\ tv tv /in tv tv /*mn tv nn /r 

CGTCCCGG GGCTAGCTACAACGA ACCCAGGC 


4609 


5086 


GUGCCGGG A CGCUCCCC 


2283 


GGGGAGCG GGCTAGCTACAACGA CCCGGCAC 


4610 


5088 


GCCGGGAC G CUCCCCGG 


2284 


CCGGGGAG GGCTAGCTACAACGA GTCCCGGC 


4611 


5096 


GCUCCCCG G UUUGGCUG 


2285 


CAGCCAAA GGCTAGCTACAACGA CGGGGAGC 


4612 


5101 


CCGGUUUG G CUGAAAGG 


2286 


CCTTTCAG GGCTAGCTACAACGA CAAACCGG 


4613 1 


5113 


AAAGGAAA G CAGAUGUG 


2287 


CACATCTG GGCTAGCTACAACGA TTTCCTTT 


4614 


5117 


GAAAGCAG A UGUGGUCA 


2288 


TGACCACA GGCTAGCTACAACGA CTGCTTTC 


4615 


5119 


AAGCAGAU G UGGUCAGC 


2289 


GCTGACCA GGCTAGCTACAACGA ATCTGCTT 


4616 


5122 


CAGAUGUG G UCAGCUUC 


2290 


GAAGCTGA GGCTAGCTACAACGA CACATCTG 


4617 


5126 


UGUGGUCA G CUUCUCCA 


2291 


TGGAGAAG GGCTAGCTACAACGA TGACCACA 


4618 


5134 


GCUUCUCC A CUGAGCCC 


2292 


GGGCTCAG GGCTAGCTACAACGA GGAGAAGC 


4619 


5139 


UCCACUGA G CCCAUCUG 


2293 


CAGATGGG GGCTAGCTACAACGA TCAGTGGA 


4620 


5143 


CUGAGCCC A UCUGGUCU 


2294 


AGACCAGA GGCTAGCTACAACGA GGGCTCAG 


4621 


5148 


CCCAUCUG G UCUUCCCG 


2295 


CGGGAAGA GGCTAGCTACAACGA CAGATGGG 


4622 


5159 


UUCCCGGG G CUGGGCCC 


2296 


GGGCCCAG GGCTAGCTACAACGA CCCGGGAA 


4623 


5164 


GGGGCUGG G CCCCAUAG 


2297 


CTATGGGG GGCTAGCTACAACGA CCAGCCCC 


4624 


5169 


UGGGCCCC A UAGAUCUG 


2298 


CAGATCTA GGCTAGCTACAACGA GGGGCCCA 


4625 


5173 


CCCCAUAG A UCUGGGUC 


2299 


GACCCAGA GGCTAGCTACAACGA CTATGGGG 


4626 


5179 


AGAUCUGG G UCCCUGUG 


2300 


CACAGGGA GGCTAGCTACAACGA CCAGATCT 


4627 


5185 


GGGUCCCU G UGUGGCCC 


2301 


GGGCCACA GGCTAGCTACAACGA AGGGACCC 


4628 


5187 


GUCCCUGU G UGGCCCCC 


2302 


GGGGGCCA GGCTAGCTACAACGA ACAGGGAC 


4629 
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5190 


CCUGUGUG G CCCCCCUG 


2303 


CAGGGGGG GGCTAGCTACAACGA CACACAGG 


4630 


5199 


CCCCCCUG G UCUGAUGC 


2304 


GCATCAGA GGCTAGCTACAACGA CAGGGGGG 


4631 


5204 


CUGGUCUG A UGCCGAGG 


2305 


CCTCGGCA GGCTAGCTACAACGA CAGACCAG 


4632 


5206 


GGUCUGAU G CCGAGGAU 


2306 


ATCCTCGG GGCTAGCTACAACGA ATCAGACC 


4633 


5213 


UGCCGAGG A UACCCCUG 


2307 


CAGGGGTA GGCTAGCTACAACGA CCTCGGCA 


4634 


5215 


CCGAGGAU A CCCCUGCA 


2308 


TGCAGGGG GGCTAGCTACAACGA ATCCTCGG 


4635 


5221 


AUACCCCU G CAAACUGC 


2309 


GCAGTTTG GGCTAGCTACAACGA AGGGGTAT 


4636 


5225 


CCCUGCAA A CUGCCAAU 


2310 


ATTGGCAG GGCTAGCTACAACGA TTGCAGGG 


4637 


5228 


UGCAAACU G CCAAUCCC 


2311 


GGGATTGG GGCTAGCTACAACGA AGTTTGCA 


4638 


5232 


AACUGCCA A UCCCAGAG 


2312 


CTCTGGGA GGCTAGCTACAACGA TGGCAGTT 


4639 


5242 


CCCAGAGG A CAAGACUG 


2313 


CAGTCTTG GGCTAGCTACAACGA CCTCTGGG 


4640 


5247 


AGGACAAG A CUGGGAAG 


2314 


CTTCCCAG GGCTAGCTACAACGA CTTGTCCT 


4641 


5255 


ACUGGGAA G UCCCUGCA 


2315 


TGCAGGGA GGCTAGCTACAACGA TTCCCAGT 


4642 


5261 


AAGUCCCU G CAGGGAGA 


2316 


TCTCCCTG GGCTAGCTACAACGA AGGGACTT 


4643 


5270 


CAGGGAGA G CCCAUCCC 


2317 


GGGATGGG GGCTAGCTACAACGA TCTCCCTG 


4644 


5274 


GAGAGCCC A UCCCCGCA 


2318 


TGCGGGGA GGCTAGCTACAACGA GGGCTCTC 


4645 


5280 


CCAUCCCC G CACCCUGA 


2319 


TCAGGGTG GGCTAGCTACAACGA GGGGATGG 


4646 


5282 


AUCCCCGC A CCCUGACC 


2320 


GGTCAGGG GGCTAGCTACAACGA GCGGGGAT 


4647 


5288 


GCACCCUG A CCCACAAG 


2321 


CTTGTGGG GGCTAGCTACAACGA CAGGGTGC 


4648 


5292 


CCUGACCC A CAAGAGGG 


2322 


CCCTCTTG GGCTAGCTACAACGA GGGTCAGG 


4649 


5301 


CAAGAGGG A CUCCUGCU 


2323 


AGCAGGAG GGCTAGCTACAACGA CCCTCTTG 


4650 


5307 


GGACUCCU G CUGCCCAC 


2324 


GTGGGCAG GGCTAGCTACAACGA AGGAGTCC 


4651 


5310 


CUCCUGCU G CCCACCAG 


2325 


CTGGTGGG GGCTAGCTACAACGA AGCAGGAG 


4652 


5314 


UGCUGCCC A CCAGGCAU 


2326 


ATGCCTGG GGCTAGCTACAACGA GGGCAGCA 


4653 


5319 


CCCACCAG G CAUCCCUC 


2327 


GAGGGATG GGCTAGCTACAACGA CTGGTGGG 


4654 


5321 


CACCAGGC A UCCCUCCA 


2328 


TGGAGGGA GGCTAGCTACAACGA GCCTGGTG 


4655 



Input Sequence = HUMRa sH_mRNA . Cut Site « R/Y 
Arm Length = 8 . Core Sequence = GGCTAGCTACAACGA 

HUMRa sH_mRNA (Human c-Ha-rasl proto- oncogene, spliced mRNA sequence; 5336 nt) 
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Table IV: Human HER2 DNAzyme and Substrate Sequence 



Pos 


Substrate 


Seq 
ID 


DNAzyme 


Seq 
ID 


9 


AAGGGGAG G UAACCCUG 


4656 


CAGGGTTA GGCTAGCTACAACGA CTCCCCTT 


5644 


12 


GGGAGGUA A CCCUGGCC 


4657 


GGCCAGGG GGCTAGCTACAACGA TACCTCCC 


5645 


18 


UAACCCUG G CCCCUUUG 


4658 


CAAAGGGG GGCTAGCTACAACGA CAGGGTTA 


5646 


27 


CCCCUUUG G UCGGGGCC 


4659 


GGCCCCGA GGCTAGCTACAACGA CAAAGGGG 


5647 


33 


UGGUCGGG G CCCCGGGC 


4660 


GCCCGGGG GGCTAGCTACAACGA CCCGACCA 


5648 


40 


GGCCCCGG G CAGCCGCG 


4661 


CGCGGCTG GGCTAGCTACAACGA CCGGGGCC 


5649 


43 


CCCGGGCA G CCGCGCGC 


4662 


GCGCGCGG GGCTAGCTACAACGA TGCCCGGG 


5650 


46 


GGGCAGCC G CGCGCCCC 


4663 


GGGGCGCG GGCTAGCTACAACGA GGCTGCCC 


5651 


48 


GCAGCCGC G CGCCCCUU 


4664 


AAGGGGCG GGCTAGCTACAACGA GCGGCTGC 


5652 


50 


AGCCGCGC G CCCCUUCC 


4665 


GGAAGGGG GGCTAGCTACAACGA GCGCGGCT 


5653 


60 


CCCUUCCC A CGGGGCCC 


4666 


GGGCCCCG GGCTAGCTACAACGA GGGAAGGG 


5654 


65 


CCCACGGG G CCCUUUAC 


4667 


GTAAAGGG GGCTAGCTACAACGA CCCGTGGG 


5655 


72 


GGCCCUUU A CUGCGCCG 


4668 


CGGCGCAG GGCTAGCTACAACGA AAAGGGCC 


5656 


75 


CCUUUACU G CGCCGCGC 


4669 


GCGCGGCG GGCTAGCTACAACGA AGTAAAGG 


5657 


77 


UUUACUGC G CCGCGCGC 


4670 


GCGCGCGG GGCTAGCTACAACGA GCAGTAAA 


5658 


80 


ACUGCGCC G CGCGCCCG 


4671 


CGGGCGCG GGCTAGCTACAACGA GGCGCAGT 


5659 


82 


UGCGCCGC G CGCCCGGC 


4672 


GCCGGGCG GGCTAGCTACAACGA GCGGCGCA 


5660 


84 


CGCCGCGC G CCCGGCCC 


4673 


GGGCCGGG GGCTAGCTACAACGA GCGCGGCG 


5661 


89 


CGCGCCCG G CCCCCACC 


4674 


GGTGGGGG GGCTAGCTACAACGA CGGGCGCG 


5662 


95 


CGGCCCCC A CCCCUCGC 


4675 


GCGAGGGG GGCTAGCTACAACGA GGGGGCCG 


5663 


102 


CACCCCUC G CAGCACCC 


4676 


GGGTGCTG GGCTAGCTACAACGA GAGGGGTG 


5664 


105 


CCCUCGCA G CACCCCGC 


4677 


GCGGGGTG GGCTAGCTACAACGA TGCGAGGG 


5665 


107 


CUCGCAGC A CCCCGCGC 


4678 


GCGCGGGG GGCTAGCTACAACGA GCTGCGAG 


5666 


112 


AGCACCCC G CGCCCCGC 


4679 


GCGGGGCG GGCTAGCTACAACGA GGGGTGCT 


5667 


114 


CACCCCGC G CCCCGCGC 


4680 


GCGCGGGG GGCTAGCTACAACGA GCGGGGTG 


5668 


119 


CGCGCCCC G CGCCCUCC 


4681 


GGAGGGCG GGCTAGCTACAACGA GGGGCGCG 


5669 


121 


CGCCCCGC G CCCUCCCA 


4682 


TGGGAGGG GGCTAGCTACAACGA GCGGGGCG 


5670 


130 


CCCUCCCA G CCGGGUCC 


4683 


GGACCCGG GGCTAGCTACAACGA TGGGAGGG 


5671 


135 


CCAGCCGG G UCCAGCCG 


4684 


CGGCTGGA GGCTAGCTACAACGA CCGGCTGG 


5672 


140 


CGGGUCCA G CCGGAGCC 


4685 


GGCTCCGG GGCTAGCTACAACGA TGGACCCG 


5673 


146 


CAGCCGGA G CCAUGGGG 


4686 


CCCCATGG GGCTAGCTACAACGA TCCGGCTG 


5674 


149 


CCGGAGCC A UGGGGCCG 


4687 


CGGCCCCA GGCTAGCTACAACGA GGCTCCGG 


5675 


154 


GCCAUGGG G CCGGAGCC 


4688 


GGCTCCGG GGCTAGCTACAACGA CCCATGGC 


5676 


160 


GGGCCGGA G CCGCAGUG 


4689 


CACTGCGG GGCTAGCTACAACGA TCCGGCCC 


5677 


163 


CCGGAGCC G CAGUGAGC 


4690 


GCTCACTG GGCTAGCTACAACGA GGCTCCGG 


5678 


166 


GAGCCGCA G UGAGCACC 


4691 


GGTGCTCA GGCTAGCTACAACGA TGCGGCTC 


5679 


170 


CGCAGUGA G CACCAUGG 


4692 


CCATGGTG GGCTAGCTACAACGA TCACTGCG 


5680 


172 


CAGUGAGC A CCAUGGAG 


4693 


CTCCATGG GGCTAGCTACAACGA GCTCACTG 


5681 


175 


UGAGCACC A UGGAGCUG 


4694 


CAGCTCCA GGCTAGCTACAACGA GGTGCTCA 


5682 


180 


ACCAUGGA G CUGGCGGC 


4695 


GCCGCCAG GGCTAGCTACAACGA TCCATGGT 


5683 


184 


UGGAGCUG G CGGCCUUG 


4696 


CAAGGCCG GGCTAGCTACAACGA CAGCTCCA 


5684 


187 


AGCUGGCG G CCUUGUGC . 


4697 


GCACAAGG GGCTAGCTACAACGA CGCCAGCT 


5685 . 


192 


GCGGCCUU G UGCCGCUG 


469B 


CAGCGGCA GGCTAGCTACAACGA AAGGCCGC 


5686 


194 


GGCCUUGU G CCGCUGGG 


4699 


CCCAGCGG GGCTAGCTACAACGA ACAAGGCC x 


5687 


197 


CUUGUGCC G CUGGGGGC 


4700 


GCCCCCAG GGCTAGCTACAACGA GGCACAAG 


5688 


204 


CGCUGGGG G CUCCUCCU 


4701 


AGGAGGAG GGCTAGCTACAACGA CCCCAGCG 


5689 


214 


UCCUCCUC G CCCUCUUG 


4702 


CAAGAGGG GGCTAGCTACAACGA GAGGAGGA 


5690 
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zz z 


ppppttptttt p r , r , r i r'nr , r*r* 


4703 


oooooooo ppp^tappt a oa a oo a a T^r^Ar^cnr* 
LLbbbbbb bbLXAbbXALAALbA AAbAbbbb 


5691 


9^9 

Z -5 Z 


LLLLCbu/i L» LLLiLLrAlaL 


4704 


PPTPPPPP PPPTSPPT a P3i R pp a TPPPPPPP 
LtL J. LoLIjLi vaULlAuLlALAftLvj/V i LLovjLrVjva 


5692 


9** t; 


LLGGALiLL Li LGAvjLALL 


4705 


<-i/~irp/i/-tppoo PPPTiAppTApA APPA OOOrp/-iOOO 

GG1GL1LG GGLlAvaLlALAALGA GGL1LLGG 


5693 


9*5 Q 


AvjLL.LfL.V3ri L» LALLLAAo 


4706 


PTTPPPTP PPPTAPPTAPA A OO A T OP 1 OO O r"V 
LJ. lV3VJV3lLr LtvjL 1 AvaLl/iLMHLLirt ILLtLvsLtLI 


5694 


941 


LLvjvAjAvsV- A LLLAAvjUvs 


4707 


pa pttppp pppTJippThPaappa never* PPP 
LA.L x X oLivj boLl Auv. lALftALbA GLILLtLLtVj 


5695 




ccxicncL.Ti. p ttpttpptapp 

bUAL.LL/\n vj UVjULjL~AV-*w. 


4708 


PPTPPBPA PPPTTlPPTTiPR AppA rr-imoOOTTT 1 
bulbLALA vav3V^l/\V5LlALAf\Lvj/\ 1 ibublbL 


5696 


OA Q 


b.ppp7\7aptt n TTPPTAPPnn! 

AV^V^^AAvjU V3 UVJV-AV-L.VJVJ 


4709 


pppptppta ppPTfippTapaappa apttpppt 

v_L-.V3V3lV3V-/i vjov- l/\vaL li^Lxirt.L.V3/\ ilLllvavavjX 


5697 


251 


fpA7\nnnTT cx ptapppppix 


4710 


TPPPPPTP PPP r T 7 iP.PT7AP7iD.Pn!ri 7AP2VPTTPP. 
IvjV-V-VJVJjI \J VjV3V-lr%V3V- JL /\V-Xir\V-.V5A jH.V»/iL 1 1 V3VJ 


5698 


OCT 


aaPTTPnnp a ppnnp&pa. 

AAV3UV3UV3V-* A v~1«V3Vjv~AV~A 


4711 


tptppppp nnm'nppTapariPPZi ppapapTT 

1 VJ i. LjV-V—VjVj oVJV-lriV3V- IxiLrwlLvjiA 


5699 


9 R7 
Z3 / 


PTTnpappp, cz raranara 

uUuUttLLu Kj v~Av~ALrAl~A 


4712 


Tf3TPTf3Tf3 PnP r PBP.PTTiP7iriPr2TA PnCTP.P7AP 
J.Lrl Llulu LrVrL- 1 /ILtv^ 1 /iV-/i/iUVJ/-\ L.VJVJj.vyL>lL 


5700 


OC Q 


p,paprnp,p n papapaTTn 

V3V_AI_V^VjVjL, A V-AV3rAl~AUVJ 


4713 


r i n r vr ,T T i r"vr' pppt'aop'paoa aopb nnnncver* 
LAlvjXLIvj ubL J. AvaL 1 ALAHLvjA vsLLGGlGL 


5701 




(Tir* Pipar a P7\Ttpaapp 

LvjVjLALAvj A LAUVaAAVjV^ 


4714 


OTTO A T'P PPPfAPPTAPA A OO A r* T T , r*'T'{~*C % C i (~ i 

L/LllLAlva VjLiLIAGL 1ALAALGA LJ.G1GLLG 


5702 


zoo 


rr'TiHTi.ri'&r* a ttptatappttp 

vjv_AL»AVjAv^ A UL»AAljl_Uva 


4715 


PAPPTTPA PPPTAP OTA OA A OO A PTPTPT'PP 

UAGL11LA vjvjLIAGLIALAALGA G1L1G1GL 


5703 


z / u 


P7iPaTTP2iZi P. PTTPPPfJPTT 
v_j A v_ A U urtn Va v~UvjL.VJVjl-.lJ 


4716 


APPPPPAP p p (^irp APPTAPA A PP A TTPATPTP 

ALfLLvjLAva LtLtLI ALjLI ALAALvaA XXLAXG1L 


5704 


Z / J 


A FTP A APPTT /-» r>pr>PTTPPP 
AUvjAAVjv-U Vj LvjVjLULU— 


4717 


POP 7\ f^CCC PPPTAPPTAPB APPB APPTTPAT 1 

GGGAGLLG ovaLXAGLXALAALGA AGL1ICAI 


5705 


99C 
Z / 0 


nappiTPPP P"* pttpppttpp 
AAvjLUvjLvj vj LULLLULtL 


4718 


r*r % T^r*r*r*'t^r % ppptapp^a oa a oo a oo pap ott 
GL-AGGGAL» gglxaglxalaalga lgcaglxx 


5706 


O Q "5 
z O J 


PPPTTPPPTT P PPaPTTPPP 
VjVjLULLLU Vj LLAvjULLL 


4719 


PPPAPTPP PPPTAPPTAPAAPPA aoooaooo 

GGGAL X vjG GGL X AGL X ALAALGA AGGGAGLL 


5707 


zo / 


it ti^o O A o ttpppp a o a 

CCCUGlCA g ucccgaga 


4720 


rrn""u"noooo tv ooorrtivoomA otv t\ ootv m»~»/*i/^T» r*t~*t~* 

TCTCGGGA GGCTAGCTACAACGA TGGCAGGG 


5708 


295 


OT A O 7\ 0007V P1/~1TT/~1 

GUCCCGAG a cccaccug 


4721 


CAGGTGGG GGCTAGCTACAACGA CTCGGGAC 


5709 


299 


r^r^t TV /"l 7\ f~Sf~\ O TV /-TT TO O TV O 7V 

CGAGALLL A CCUGGACA 


4722 


TGTCCAGG GGCTAGCTACAACGA GGGTCTCG 


5710 


305 


PPRPrtTPP 7V P7kTTPPITPP 

LCALLUGG A LAUbLULL 


4723 


GGAGCATG GGCTAGCTACAACGA CCAGGTGG 


5711 


3 07 


ACCUGGAC A UGCUCCGC 


4724 


GCGGAGCA GGCTAGCTACAACGA GTCCAGGT 


5712 


309 


CUGGACAU G CUCCGCCA 


4725 


TGGCGGAG GGCTAGCTACAACGA ATGTCCAG 


5713 


314 


CAUGCUCC G CCACCUCU 


4726 


AGAGGTGG GGCTAGCTACAACGA GGAGCATG 


5714 


317 


GCUCCGCC A CCUCUACC 


4727 


GGTAGAGG GGCTAGCTACAACGA GGCGGAGC 


5715 


323 


CCACCUCU A CCAGGGCU 


4728 


AGCCCTGG GGCTAGCTACAACGA AGAGGTGG 


5716 


•a o q 


LUALCAGG G LUGLLAGG 


4729 


CCTGGCAG GGCTAGCTACAACGA CCTGGTAG 


5717 


•an 
ojz 


C'f'Tir'fPPTT O PPRPPTTPP 

LUAbvjoLU G LLAbbUbo 


4730 


OOA OOT'OO PPPTAPPTAPAAPPA 7\ O OOOT>00 

CCACCTGG GGCTAGCTACAACGA AGCCCTGG 


5718 




oottoooao o itpptippap 
GLUGLLAG G UGGUGLAG 


4731 


CTGCACCA GGCTAGCTACAACGA CTGGCAGC 


5719 


1 A t\ 


OOOAOOITO O nPPTlPPPA 

GLLAGGUG G UGLAGGGA 


4732 


rp/^/^70 rfiA~i TV 0 pjni tv /trrt TA TV TV ^t/~i 7V ^tv /^/^m/~t/™»/^ 

TCCCTGCA GGCTAGCTACAACGA CACCTGGC 


5720 


n a o 
J4z 


PAPPTTPPTT O O TV OOP 1 AAA 

LAGGUGGU G LAGGGAAA 


4733 


TTTCCCTG GGCTAGCTACAACGA ACCACCTG 


5721 


OCA 
■33U 


OOAOOOAA A OOTTOOAAO 

GLAGGGAA A LLUGGAAL 


4734 


GTTCCAGG GGCTAGCTACAACGA TTCCCTGC 


5722 


"3 C "7 


AAOOTTOOA A OTTOAOOITA 

AACCUGGA A CUCACCUA 


4735 


m Tv /"irT^fi Tv /^i s**m t% /^m tv t\ tv /*7/^t t\ ptt/*i^ttv ^t/*i#twtt. 

TAGGTGAG GGCTAGCTACAACGA TCCAGGTT 


5723 


361 


TTOOAAOTTO A OOTTAOOTTO 

UGGAALUL A LLUALLUG 


4736 


CAGGTAGG GGCTAGCTACAACGA GAGTTCCA 


5724 


365 


A OT TO A O OT T A OOTTOOOOA 

ALULALLU A LLUGLLLA 


4737 


TGGGCAGG GGCTAGCTACAACGA AGGTGAGT 


5725 


369 


AOOTTAOOTT O OOOAOOTVA 

ALLUALLU G CCCACLAA 


4738 


nvi>P<PirnPi/^/i /*i/^/^irfriv /^/*irrvT\ /^» tv tv n/iiv tv ri/im'jx pom 

TTGGTGGG GGCTAGCTACAACGA AGGTAGGT 


5726 


373 


ACCUGCCC A CCAAUGCC 


4739 


GGCATTGG GGCTAGCTACAACGA GGGCAGGT 


5727 


377 


GCCCACCA A UGCCAGCC 


4740 


GGCTGGCA GGCTAGCTACAACGA TGGTGGGC 


5728 


379 


CCACCAAU G CCAGCCUG 


4741 


CAGGCTGG GGCTAGCTACAACGA ATTGGTGG 


5729 


383 


CAAUGCCA G CCUGUCCU 


4742 


AGGACAGG GGCTAGCTACAACGA TGGCATTG 


5730 


387 


GCCAGCCU G UCCUUCCU 


4743 


AGGAAGGA GGCTAGCTACAACGA AGGCTGGC 


5731 


396 


UCCUUCCU G CAGGAUAU 


4744 


ATATCCTG GGCTAGCTACAACGA AGGAAGGA 


5732 


401 


CCUGCAGG A UAUCCAGG 


4745 


CCTGGATA GGCTAGCTACAACGA CCTGCAGG 


5733 


403 


UGCAGGAU A UCCAGGAG 


4746 


CTCCTGGA GGCTAGCTACAACGA ATCCTGCA 


5734 


412 


UCCAGGAG G UGCAGGGC 


4747 


GCCCTGCA GGCTAGCTACAACGA CTCCTGGA 


5735 


414 


CAGGAGGU G CAGGGCUA 


4748 


TAGCCCTG GGCTAGCTACAACGA ACCTCCTG 


5736 


419 


GGUGCAGG G CUACGUGC 


4749 


GCACGTAG GGCTAGCTACAACGA CCTGCACC 


5737 


422 


GCAGGGCU A CGUGCUCA 


4750 


TGAGCACG GGCTAGCTACAACGA AGCCCTGC 


5738 


424 


AGGGCUAC G UGCUCAUC 


4751 


GATGAGCA GGCTAGCTACAACGA GTAGCCCT 


5739 


426 


GGCUACGU G CUCAUCGC 


4752 


GCGATGAG GGCTAGCTACAACGA ACGTAGCC 


5740 


430 


ACGUGCUC A UCGCUCAC 


4 753 


GTGAGCGA GGCTAGCTACAACGA GAGCACGT 


5741 


433 


UGCUCAUC G CUCACAAC 


4754 


GTTGTGAG GGCTAGCTACAACGA GATGAGCA 


5742 
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437 


CAUCGCUC A CAACCAAG 


4755 


CTTGGTTG GGCTAGCTAPAAPGA GAGCGATG 


5743 


440 


CGCUCACA A CCAAGUGA 


4756 


TCACTTGG GGCTAGCTACAACGA TGTGAGCG 


5744 


445 


ACAACCAA G UGAGGCAG 


A1 EZH 


CTGCCTCA GGCTAGCTACAACGA TTGGTTGT 


5745 


450 


CAAGUGAG G CAGGUCCC 


4758 


GGGACCTG GGCTAGCTACAACGA CTCACTTG 


5746 


454 


UGAGGCAG G UCCCACUG 


4759 


CAGTGGGA GGCTAGCTACAACGA CTGCCTCA 


5747 


459 


CAGGUCCC A CUGCAGAG 


a *7 tin. 
4 /60 


CTCTGCAG GGCTAGCTACAACGA GGGACCTG 


5748 


462 


GUCCCACU G CAGAGGCU 


4 /ol 


AGCCTCTG GGCTAGCTACAACGA AGTGGGAC 


5749 


468 


CUGCAGAG G CUGCGGAU 


4762 


ATCCGCAG GGCTAGCTACAACGA CTCTGCAG 


5750 


471 


CAGAGGCU G CGGAUUGU 


4763 


ACAATCCG GGCTAGCTACAACGA AGCCTCTG 


5751 


475 


GGCUGCGG A UUGUGCGA 


A 1 C. A. 

4 / b4 


TCGCACAA GGCTAGCTACAACGA CCGCAGCC 


5752 


478 


UGCGGAUU G UGCGAGGC 


4765 


GCCTCGCA GGCTAGCTACAACGA AATCCGPA 


5753 


480 


CGGAUUGU G CGAGGCAC 


4766 


GTGCCTCG GGCTAGCTACAACGA ACAATCCG 


5754 


485 


UGUGCGAG G CACCCAGC 


4767 


GCTGGGTG GGCTAGCTACAACGA PTPGPAPA 

w w i www -L wf v_7\_> w _l rvuvt x ri^/vr^vj/^ w»i www/A w** 


5755 


487 


UGCGAGGC A CCCAGCUC 


4768 


GAGCTGGG GGCTAGCTACAACGA GCCTCGCA 


5756 


492 


GGCACCCA G CUCUUUGA 


4769 


TCAAAGAG GGCTAGCTACAACGA TGGGTGCC 


5757 


503 


CUUUGAGG A CAACUAUG 


4770 


CATAGTTG GGCTAGCTACAACGA CCTCAAAG 


5758 


506 


UGAGGACA A CUAUGCCC 


4771 


GGGPATAG GGPTAGPTAPAAPGA TGTPPTPA 

vjvjvjurt. vjvju ArtV3U 1 n^onv<vjrv 1 <J i V«\> X V^n 


5759 


509 


GGACAACU A UGCCCUGG 


4772 


CCAGGGCA GGPTAGPTAPAAPGA AGTTGTPP 
v.vn\jvuvn vjuv-^nvv.irtwviv.urt rt vj X J. vj J. u u 


5760 


511 


ACAACUAU G CCCUGGCC 


4773 


GGCCAGGG GGPTAGPTAPAAPGA ATAP.TTPT 

VJv3UUrtVJv3V7 VJVJU XrtVJU XrtUrtrtUVjrt AlnU X J. V3 X 


5761 


517 


AUGCCCUG G CCGUGCUA 


4774 


tagpapgg ggptagptapa arna rann^raT 

X rtw v-rt^UU VjVjj U X nuL i. rtUrtrtUVjrt V^rtvjvJ VJ v~-rt X 


5762 


520 


CCCUGGCC G UGCUAGAC 


4775 


GTCTAGPA GGPTAGPTAPAAPGA PPPPAGPn 


5763 


522 


CUGGCCGU G CUAGACAA 


4776 


TTGTCTAG GGCTAGCTACAACGA A CORP PAG 


5764 


527 


CGUGCUAG A CAAUGGAG 


4777 


CTCCATTG GGCTAGCTACAACGA PTAGCAPG 

W A W W-^J x 4. w 1 ww'W X/lvv J./*wlAw\3A w X/^wwx>ww^ 


5765 


530 


GCUAGACA A UGGAGACC 


4778 


GGTCTCCA GGCTAGCTACAACGA TGTPTAGC 

wwi*»w*»wwx> www a-Tiww j. riwv* ww XwxwXx^Njrw 


5766 


536 


CAAUGGAG A CCCGCUGA 


4779 


i TCAGCGGG GGCTAGCTACAACGA PTCPATTG 

A W**W^WWAJw W\JW J» xlUL X ACrl/lVHiVjri w v\#Al X W 


5767 


540 


GGAGACCC G CUGAACAA 


4 78 0 


TTGTTCAG GGCTAGCTACAACGA GGGTCTCC 


5768 


545 


CCCGCUGA A CAAUACCA 


4781 


TGGTATTG GGPTAGPTAPAAPGA TPAGPGGG 
xvjvriiii i\j vjuv a nvi. i nunnwvsn x urtvjwjvjvj 


5769 


548 


GCUGAACA A UACCACCC 


4782 


GGGTGGTA GGCTAGPTAPAAPGA TfiTTPAGP 

uuuiuuxrt uvjv. i nw v< x nv^inv<vin xui xurtuu 


5770 


550 


UGAACAAU A CCACCCCU 


4783 


AGGGGTGG GGPTAGPTAPAAPGA ATTP.TTPA 


5771 


553 


ACAAUACC A CCCCUGUC 


4784 


GACAGGGG GGPTAGPTAPAAPGA GGTATTGT 

vjrturt vjvj vjvj uooirtuv.invJviwurt Uulftj. lul 


5772 


559 


CCACCCCU G UCACAGGG 


4785 


PPCTGTGA GGPTAGPTAPAAPGA AGGGGTGf! 
UV.V.1U1 vjrt vjvju XrtVJv. X rtUrlri v- Vjrt Avjvjuo 1 VjIj 


5773 


562 


CCCCUGUC A CAGGGGCC 


4786 


GGPPPPTG GGPTAGPTAPA A PP A Pafl^PPr 

OULLUV. X VJ SJOV.XrtlJVjXrtW~rtrtV.ljrt vjAuAv3V3v3VJ 


5774 


568 


UCACAGGG G CCUCCCCA 


4787 


TGGGGAGG GGPTAGPTAPA APP. A rfTTPTPa 

X VJVJVJVjrtVJVJ V3v7UXrtV7V.XrtV.rtrtV.V3rt LL-LlljluA 


5775 


581 


CCCAGGAG G CCUGCGGG 


4788 


CPCGPAGG GGPTAGPTAPAAPGA PTPPTP.P.r2 
wuuuunuu vjvju XrtVJU XrtUrtrtUVjrt v. X uu 1 uuu 


5776 


585 


GGAGGCCU G CGGGAGCU 


4789 


AGCTPCCG GGPTAGPTAPAAPGA ArcrJPPTPP 

rtVJU X UUUV7 VJVJU J. rtVJU X rtUrtrtUVJrt ftuuV.V^lV.V> 


5777 


591 


CUGCGGGA G CUGCAGCU 


4790 


AGCTGPAG GGPTAGPTAPAAPGA TPPPPPA^i 

rtUUlUUftU VJVJU XrtVJU XrtUrtrtUVjrt X UV^UVJV*Av3 


5778 


594 


CGGGAGCU G CAGPUUPG 


4791 


u vJrtrtvyv. X Kj uuL X AvJ v. 1 A V-AfHuVjA A vj L. 1 U V. v-VJ 


5779 


597 


gagpttgpa g pttitpgaag 


4792 


V.J. iLUrtftU VJVJU X rtv3V. J. AuAAuvjA lv7UrtVJV.Xv. 


5780 


605 


GPUIIPGAA G PPT TP A PAG 
ov.uuv.unn vj u v- u UrtUrtVj 


4793 


V.XV3i.V3rtV3V3 V3V3V.lrtVJV.lAv.rtrtV.V3 A IIUVjAAvjC 


5781 


610 


GAAGCCUC A PAGAGAOP 

urUl\JV„VUV, rt. UrtV3rtV7rt V/U 


4794 


v3rtlv.lL.lv3 V3V3V.1 Av3v. 1 Av^AACvjA VjAv3v3L11v. 


5782 


616 


UCACAGAG A UCUUGAAA 


4795 


T"T"TPA ZVnZi P.P.PT7Af2Pf APA RnPA PTPfP^P 7\ 
XXXUrtrtvjrt v3v3v.XrtvjulAUrtAv.V3A v.lulvjlvjA 


5783 


631 


AAGGAGGfj n t tpttttp a nn 

rtMvjrVjrtVjVovj vj Uv.UUv7AUv«. 


4796 


CaA\l LAAGA GGCTTAGCTACAACGA CCCTCCTT 


5784 


637 


GGGUCUUG A UCCAGCGG 


4797 


CCGCTGGA GGCTAGCTACAACGA PAAGAPPP 

WWWWJ-WW^*. WW W 4-**W W IrtuXlrtV- \JT\ wiTivJrtV^. W w 


5785 


642 


UUGAUCCA G CGGAACCC 


4798 


GGGTTCCG GGCTAGCTACAACGA TGGATCAA 


5786 


647 


CCAGCGGA A CCCCCAGC 


4799 


GCTGGGGG GGCTAGCTACAACGA TCCGCTGG 


5787 


654 


AACCCCCA G CUCUGCUA 


4800 


TAGCAGAG GGCTAGCTACAACGA TGGGGGTT 


5788 


659 


CCAGCUCU G CUACCAGG 


4801 


CCTGGTAG GGCTAGCTACAACGA AGAGCTGG 


5789 


662 


GCUCUGCU A CCAGGACA 


4802 


TGTCCTGG GGCTAGCTACAACGA AGCAGAGC 


5790 


668 


CUACCAGG A CACGAUUU 


4803 


AAATCGTG GGCTAGCTACAACGA CCTGGTAG 


5791 


670 


ACCAGGAC A CGAUUUUG 


4804 


CAAAATCG GGCTAGCTACAACGA GTCCTGGT 


5792 


673 


AGGACACG A UUUUGUGG 


4805 


CCACAAAA GGCTAGCTACAACGA CGTGTCCT 


5793 


678 


ACGAUUUU G UGGAAGGA 


4806 


TCCTTCCA GGCTAGCTACAACGA AAAATCGT 


5794 
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686 


OTTnn 7V TV P*PI TV /"TTV T TP»TTTTP>PI 

GUGGAAGG A CAUCUUCC 


4807 


P'P'AAP'AT'P 1 PrPTAf PT" 7\ P 1 7\ APPA PPTTPPliP 

GGAAIjA 1 G vjvj»V- 1 AGC J. AC AACG A U L. 1 i LUAL 


5795 


boo 


rT* TV A P 1 P' A P 1 A TTPTTi'IPP'AP 

UvjAAvjGAC A Ul-UUCvJAt, 


4808 


ptppa ap a ppptzippt a pa a pp. a ptppttpp 

Vjr X uvririibA o\jv_ 1 /iljL. X Av,AAL vj>\ Vj JL V^l^ X X V-V- 


5796 




PailPlllTPP 7A Pa 7Af27A aPa 


4809 


t^ttptth nnPTAPPTapaappa ppaanaTP 


5797 




pfapaapa a paappapp 

LUrivJiAuii A v~AAU-v_HVjv- 


4810 


pptppttp ppPTJXPPTapaappa tpttptpp 


5798 


1 CiA 


pa ana a pa a ppanPTT/2f2 

LnnurtAUi /i \^v_J-iv_7L.Uvjvj 


4811 


nr t 'h( r zr tr rcin. pp.PTapPTapaappa tp.ttpttp 

\- L~/-iVjv^ X ovj vwt X iaVjV- X >lV^iLrt.L.OM. IbllLllb 


5799 


T Pi Q 


a a Pa a ppa p pt tpppttptt 
AAL_ttAv^CA Vj L.UG\jV-UCU 


4812 


apapppap pppTapPTapaappa tppttptt 

AvjAov-LIAvj vjvjUX/\vjUX Av»J\i^Uvj/\ luul Ivjl 1 


5800 


/ ±Z 


APPAfPTTP r» PTTPTTPAPa 


4813 


TPTPapap ppPTapPTapa appa r*"hnH r rf* , r* , v 
ibl bnunb Vavjjl_ X >ibj 1 Av.AAv.urA. L^n-Vj u X X 


5801 


/ -Lb 


ttpppttptip a paPTtPaTia 

UIjvjL-UUUL. A LACUvjAUA 


4814 


tatpaptp pppTapPTapaappa napapppa 

X/1X v_/\bj X vj vj\jv^ X AVav- X ALAAv,uA vji\vj/\vjv».v— i\ 


5802 




ppttpttpap a PTipaiTapa 


4815 


TPTaTpan pppTapPTapaappa p.tp a p a p p 


5803 


*7 0 A 


npapapup a napapapp 


4816 


ppTPTPTa pppTapPTapaappa paPTPTPa 

ovaXvjXCXA vj\JV-X Autv-X ALAALbA (JAvjtXGXAjA 


5804 


TOO 
O 


apTTpaiTan a fappa app 
ACUGAUAo A vJALLftAtv., 


4817 


ppttpptp ppprapprapaappa ptbtpipt 

vjVj X X vjljr 1 vj VjtjUXAVjUXACAAL.ijA LlAlUAbl 


5805 


/ JU 


irparrar'ap a r , f~ i T^'Ar*r , r , n 
UloAUAGAt- A LLAAtLoL 


4818 


r , r^nn r T xr vr , H pppTapPTapaappa ptptatpb 
ljUvi\aX Xvjvj vjvjL.XAVjL.1 ACAACGA UXCXAILA 


5806 


"7 "3 /I 


SP21PRPP7\ a PPPPITPTTP 
AuAtALLA A CCvaUUCUC 


4819 


papapp'p^p* p*p , p ,r rapP r PA p»a a pv**a r rpp ,r PP ,r PP w P 
GAGAlaCCtvj GGC X AGC I ACAACGA XGGXGXCX 


5807 


Til 


CAUV-AACC \3 CUCUCGGG 


4820 


PPPPAPTlP PPPTTkPPTinPAAPPA r , P , rnrpr<P"TTr' 

CCHjAGAG GGCX AG CI ACAACGA GGilGGxG 


5808 


745 


GCUCUCGG G CCUGCCAC 


4821 


PTPPPAPP /^P'P'T'tv /-I pirn t\ /-tTv TV PV TV P»plp»n P< A P^P* 

GTGGCAGG GGCTAGCTACAACGA CCGAGAGC 


5809 




nppppppii p rra c i r*r y r > tt 
UCGGGCCU G CtALtLLU 


4822 


7\ /™'/"'P" , P"7V"i/~t PPPT7jPPT 1 71P71?VPPR JlPPPPPP)\ 

AGGGtjji<j(j GGCx AGC X ACAACGA AGGCCCGA 


5810 


752 


ppnr'TTr'rip a p , p , p , p*ttp , tttt 
GGCCUGCC A CCCCUGUU 


4823 


A A P>A Ptptptpt P»P*P"T'AP^P«nrt7V P« A TV PV"» A P»P"nTl P»P»PTP» 

AACAGGGG GGCxAGCTACAACGA GGCAGGCC 


5811 


758 


CCACCCCU G UUCUCCGA 


4824 


TCGGAGAA GGCTAGCTACAACGA AGGGGTGG 


5812 


766 


/"•TTTTP1TT/-1/-V1 TV T T/TT TP1T T 7k TV /"I 

GUUCUCCG A UGUGUAAG 


4825 


CTTACACA GGCTAGCTACAACGA CGGAGAAC 


5813 


768 


TiPiTrrimMT /~» yt/"itttv tv fi/ir* 

UCUCCGAU G UGUAAGGG 


4826 


CCCTTACA GGCTAGCTACAACGA ATCGGAGA 


5814 


770 


UCCGAUGU g uaagggcu 


4827 


AGCCCTTA GGCTAGCTACAACGA ACATCGGA 


5815 


776 


GUGUAAGG g cucccgcu 


4828 


tv Ptpipipt/^i tv /"» <^P»PtrnTv pipmiv /^tv tv ^»/-itv PPmrn* /in n 

AGCGGGAG GGCTAGCTACAACGA CCTTACAC 


5816 


7 82 


nnnPTTPHP p» /^ , ttpt i tt/*^p'p» 
GGGCUCCC G CUGCUGGG 


4829 


P'P'P'Tv P« (^7\ i™» nrr tv /—< Ptrn tv Ti tv /t^t tv r*f*t~\ t\ 

CCCAGCAG GGCTAGCTACAACGA GGGAGCCC 


5817 


785 


CUCCCGCU G CUGGGGAG 


4830 


CTCCCCAG GGCTAGCTACAACGA AGCGGGAG 


5818 


797 


/-1/~1/~l7\/~l7i /-l TV /1 TTt TP'TTP' TV P'P 1 

GGGAGAGA G UUCUGAGG 


4831 


o/~»t>/"»tv /"T tv tv /~» nm tv pt Pim tv tv tv nn tv mnm /-im /— 

CCTCAGAA GGCTAGCTACAACGA TCTCTCCC 


5819 


806 


TTTTP»TTP»TV /"If* TV T TT T/~>T TP1TV Pt Tk 

UUCUGAGG A UUGUCAGA 


4832 


T»^*"**rri/*l T\ /"I TV T\ rir*/ «Tt\ i^f II TV /^T\ TV /* t/^ TV /^m f** "TV IV TV 

TCTGACAA GGCTAGCTACAACGA CCTCAGAA 


5820 




T TP* A P* P* A T TT T P» TTP'AP'AP'P'P' 

UGAGGAUU G UCAGAGCC 


4833 


P'P'P'T'Pirp/'^TV PPPT A P<PvrtTV /-«TV TV /np T\ A TV rr\/~\r\rr\ri A 

GGCxCTGA GGCTAGCTACAACGA AATCCTCA 


5821 


815 


TTTTP'TTP'A /~t A P* PPT TP" A PV"1P» 

UUGUCAGA G CCUGACGC 


4834 


GCGTCAGG GGCTAGCTACAACGA TCTGACAA 


5822 


o o r\ 


TV PtTV P»P"»/*"«TTf' TV PV* P»P> P1TV OTT 

AGAGCCUG A CGCGCACU 


4835 


«prr>PPoinr» /"^i~« /-trp tv p«n tv iv tv yii*r tv /— i7\ /* /'tfi i/*-*m 

AGTGCGCG GGCTAGCTACAACGA CAGGCTCT 


5823 


o <d A 


7\PPPT7PAP P" P'P'P'A P*TTP'TT 

AGCCUGAC G CGCACUGU 


4836 


A PAPT'PPP PPPfT?\PPrp)\PA7iPP7\ P"TTP"7VP«P'P"T' 

ACAG X GCG GGC I AGC I ACAACGA Gx CAGGCT 


5824 


824 


PPTTPAPPP P" Pt A P*T TP*T T/TT T 

CCUGACGC G CACUGUCU 


4837 


AGACAGTG GGCTAGCTACAACGA GCGTCAGG 


5825 


826 


TTP» A PiPtPlOP" A /-IT TP'TTP'T T/~«T T 

UGACGCGC A CUGUCUGU 


4838 


TV P» TV P* TV /""I TV /~i /TPrpirnTV /lOmTV /"ITV Tv ^i/-itv /-1 #— 1/-1 r^^-i rxi f"1 T\ 

ACAGACAG GGCTAGCTACAACGA GCGCGTCA 


5826 


829 


CGCGCACU G UCUGUGCC 


4839 


GGCACAGA GGCTAGCTACAACGA AGTGCGCG 


5827 


833 


CACUGUCU G UGCCGGUG 


4840 


TV / m */^f~**/~1 T\ /IfT^ TV P,m "JV i - 1 1V TV TV TV Tv TV m/*1 

CACCGGCA GGCTAGCTACAACGA AGACAGTG 


5828 


835 


CUGUCUGU G CCGGUGGC 


4841 


GCCACCGG GGCTAGCTACAACGA ACAGACAG 


5829 


839 


/"TTT/"TTT/"V/"*'I/1/~I /*t T T/~1 /~1 /~1T T/~1T T/**1 

CUGUGCCG G UGGCUGUG 


4842 


CACAGCCA GGCTAGCTACAACGA CGGCACAG 


5830 


842 


tlOPPP/*>TT/*l /->TT/~1TTl""l/-1/-t/""l 

UGCCGGUG G CUGUGCCC 


4843 


GGGCACAG GGCTAGCTACAACGA CACCGGCA 


5831 


845 


CGGUGGCU G UGCCCGCU 


4844 


AGCGGGCA GGCTAGCTACAACGA AGCCACCG 


5832 


847 


GUGGCUGU G CCCGCUGC 


4845 


GCAGCGGG GGCTAGCTACAACGA ACAGCCAC 


5833 


851 


CUGUGCCC G CUGCAAGG 


4846 


CCTTGCAG GGCTAGCTACAACGA GGGCACAG 


5834 


854 


UGCCCGCU G CAAGGGGC 


4847 


GCCCCTTG GGCTAGCTACAACGA AGCGGGCA 


5835 


861 


UGCAAGGG G CCACUGCC 


4848 


GGCAGTGG GGCTAGCTACAACGA CCCTTGCA 


5836 


864 


AAGGGGCC A CUGCCCAC 


4849 


GTGGGCAG GGCTAGCTACAACGA GGCCCCTT 


5837 


867 


GGGCCACU G CCCACUGA 


4850 


TCAGTGGG GGCTAGCTACAACGA AGTGGCCC 


5838 


871 


CACUGCCC A CUGACUGC 


4851 


GCAGTCAG GGCTAGCTACAACGA GGGCAGTG 


5839 


875 


GCCCACUG A CUGCUGCC 


4852 


GGCAGCAG GGCTAGCTACAACGA CAGTGGGC 


5840 


878 


CACUGACU G CUGCCAUG 


4853 


CATGGCAG GGCTAGCTACAACGA AGTCAGTG 


5841 


881 


UGACUGCU G CCAUGAGC 


4854 


GCTCATGG GGCTAGCTACAACGA AGCAGTCA 


5842 


884 


CUGCUGCC A UGAGCAGU 


4855 


ACTGCTCA GGCTAGCTACAACGA GGCAGCAG 


5843 


888 


UGCCAUGA G CAGUGUGC 


4856 


GCACACTG GGCTAGCTACAACGA TCATGGCA 


5844 


891 


CAUGAGCA G UGUGCUGC 


4857 


GCAGCACA GGCTAGCTACAACGA TGCTCATG 


5845 


893 


UGAGCAGU G UGCUGCCG 


4858 


CGGCAGCA GGCTAGCTACAACGA ACTGCTCA 


5846 
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895 


AGCAGUGU G CUGCCGGC 


4859 


GCCGGCAG GGCTAGCTACAACGA ACACTGCT 


5847 


898 


AGUGUGCU G CCGGCUGC 


4860 


GCAGCCGG GGCTAGCIACAACGA AGCACACI 


5848 


O A O 

yuz 


TTrrmp^pA p 1 p^T^P'^»AP 1 P'P , 
UGCUGCCG G CUGCACGG 


4861 


PPPIVPTiP PP PT7i PPTTi PAH PP 7\ PPPPTiPPTi 

CCGJajCAG LiGClAGL,iACAAL.GA L.GGCAGUA 


5849 


905 


UGL.CL>CjL. U G CACGGGL.L. 


4862 


/-ipip'/TVT'r 1 PPPT^PPTIi /"""A APPTt 7APPPPPP7A 

L7GCCCLj1Li uCaClALrClACAACLiA AL7LC0L1UA 
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jILLIIjALA VjOLIALLIALAALoA LlAIALoJ. 


5972 


X 1 ! ± £ t 


apaitphpa p PBTirppfn 

ALAULULA O LAULLLLL 


4985 


r y C K (T , (~'7\ r Vr K PPPTAPPTAPAAPPA TPAPATPT 
LoVjLLAxvj uuL 1 nuL 1 ALAALuA 1 brtbH ioi 


5973 


X*i X D 


attpt tpapp a ttppppppa 
AULULALL A ULLLLLLA 


4986 


TPPnrfP A OOOT" A OOT" A OA A OO A PPTP A O A T 1 

ILUjLjLLA VjvjLIAoLIALAALLA LLILALAx 


5974 


1/11 Q 


77pappa77P p nprrapar 
ULALLAUL L LLLLALAL 


4987 


OTO THPPP PPPT'APPTAPA APPA PATPPTPA 

LlolLLLo ooLxAoLxALAALLA LAloLxLA 


5975 


1 A 7 A 


attoooooo a r , ?\pppr7PP 

AUoooU.v5L A LALLLUoL 


4988 


OOAOOPT'O PPPTAPPTAPAAPPA PPPPPPAT 
LLALLL 1 L VsL»L 1 AvjL x ALAALoA LLLoLLAx 


5976 




(TT , r*r , 7S OA P OP 1 ! TP OOT TP 1 

LLLLLALA Vj LLULLLUL 


4989 


OAOOOAOO PPOT'A POT 1 A O A A PP A TPTPPrnr 1 

LALLLAbL ooL 1 AoL X ALAALCjA 1G1LLLLL 


5977 


X4J X 


PTipTi pnpt t pi OOT TO A OOTT 
LALALLLU L LLULALLU 


4990 


APPT^PAPO PP PTA O PTA O A A OO A AOOOTVTPP 

AbbivJAbb GLrLI AbL 1 ALAALGA AGGLxGXL 


5978 


1 A "5 £ 


OOT TP 1 OOT TP 1 A OOT TO A OOO 

LLULLLUL A LLULAGLL 


4991 


p«pi nmp A O O PPPTAPPTAPAAPPA PAPPP7\ PP 

LLjL 1 GALL LGL 1 Abt x ALAALLA LALGLALL 


5979 


T A A O 


77P , ap i P , T7P , a r* ^ , ^'^f^^ , ^\^^^^ , ^ , 
UIjALLULA Vj LvjULUULL 


4992 


PPAAPAPP PPPTAPPTAPAAPPA TPAPPTPA 

LLAALALL GLL X AGL 1 ALAALLjA X bAbb 1 LA 


5980 


T A A A 
1444 


APPTTPAPP f~* T TOT TT TP'P" A P 1 

ALLULALjL ULUULLALj 


4993 


PTPPAAPA PPPTAPPTAPAAPPA PPTPAPPT 

L1GGAAGA GGLI AGLIALAALGA GLIGAGGI 


5981 


1 /l ca 
14r>4 


PITTTPPAPA A PPITPP A AO 

LUULLAIjA A LLUIjLAAvj 


4994 


PTTPPAPP PPPTAPPT A PA A PPA TPTPPAAP 

CTxGCAGG GGCTAGCTACAACGA TCTGGAAG 


5982 


1 ACQ 

l4bo 


OAOAAOOTT O 07\7\OTTAATT 

LAtaAALLU L LAALjUAAU 


4995 


ATTACTTG GGCTAGCTACAACGA AGGTTCTG 


5983 


1 A £0 


A 007TOOA A O TTA ATTOOOO 

ALLUoLAA la UAAULLLjo 


4996 


PPPPATTA PPPTAPPTAPAAPPA TTOOAOOT 

LLGLA11A GGL1 ALL i ALAALLA IxLLALLl 


5984 




1TOOAAOT7A A TTOOOOOOA 

U<jLAAL>UA A U L LLruLiLiA 


4997 


rpppppp/~i t\ PPPTAPPTAPAAPPA TAOTTPPA 

TLLLLGGA GGCTAGCTACAACGA TACTTGCA 


5985 


x*± / J 


ATTOOOOPIO A OO A ATTTTOT T 
AULLLjoVjKj A LVjAAUULU 


4998 


APAATTPP PPPTAPPTAPAAPPA rTTT^/^T^ A T 

ALAA 1 1 LG LGL 1 AGL 1 ALAALGA LLLLGGA 1 


5986 


1 AT7 

14 / / 


OOOOAOOA A TTTTOTTOOAO 

(jLjL»LiALLjA A UULUIjLAL 


4999 


PTPPAPAA PPPTAPPTAPAAPPA TPPrnnPf^P 

GTGCAGAA GGCTAGCTACAACGA TCGTCCCC 


5987 


7 A fl 9 


pp zv at777pt7 p pa0aat7oo 
LvjAAUUV-U Vj LALAAUoVj 


5000 


PPATTPTP PPPTAPPTAPAAPPA APAATTPP 

LLAX Iblb LGLX ALL X ALAALGA AGAAX1LG 


5988 


1 A Q A 

14 o4 


AATTTTOT700 A O A A T TOO OO 

AAUULULjL A LAAULKjLLt 


5001 


pipip^/-^ 7\ rpTO PPPTAPPTAPAAPPA PPAPAATT 

LGLLAxTG GGCxAGLIACAACGA GCAGAATT 


5989 


14 8 7 


UCUGCACA A UGGCGCCU 


5002 


AGGCGCCA GGCTAGCTACAACGA TGTGCAGA 


5990 


14 90 


ooaoaatto o oooottaott 
GLALAAUG O LGLLUALU 


5003 


APTAPPPP p /-tytm A P fit n A P A A pp A pa inmnrnrtn 

AGTAGGCG GGCTAGCTACAACGA CATTGTGC 


5991 


1492 


ACAAUGGC G CCUACUCG 


5004 


CGAGTAGG GGCTAGCTACAACGA GCCATTGT 


5992 


1496 


UGGCGCCU A CUCGCUGA 


5005 


TCAGCGAG GGCTAGCTACAACGA AGGCGCCA 


5993 


1500 


GCCUACUC G CUGACCCU 


5006 


AGGGTCAG GGCTAGCTACAACGA GAGTAGGC 


5994 


1504 


ACUCGCUG A CCCUGCAA 


5007 


TTGCAGGG GGCTAGCTACAACGA CAGCGAGT 


5995 


1509 


CUGACCCU G CAAGGGCU 


5008 


AGCCCTTG GGCTAGCTACAACGA AGGGTCAG 


5996 


1515 


CUGCAAGG G CUGGGCAU 


5009 


ATGCCCAG GGCTAGCTACAACGA CCTTGCAG 


5997 


1520 


AGGGCUGG G CAUCAGCU 


5010 


AGCTGATG GGCTAGCTACAACGA CCAGCCCT 


5998 


1522 


GGCUGGGC A UCAGCUGG 


5011 


CCAGCTGA GGCTAGCTACAACGA GCCCAGCC 


5999 


1526 


GGGCAUCA G CUGGCUGG 


5012 


CCAGCCAG GGCTAGCTACAACGA TGATGCCC 


6000 


1530 


AUCAGCUG G CUGGGGCU 


5013 | 


AGCCCCAG GGCTAGCTACAACGA CAGCTGAT 


6001 


1536 


UGGCUGGG G CUGCGCUC 


5014 


GAGCGCAG GGCTAGCTACAACGA CCCAGCCA 


6002 
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1539 


CUGGGGCU G CGCUCACU 


5015 


AGTGAGCG GGCTAGCTACAACGA AGCCCCAG 


6003 


1541 


PPPrT'TTPr' /~i frt TPi A P»TTP< A 

GGGGCUGC G CUCACUGA 


5016 


m/*i7v tv i Tt /*i priPiii\nnTiv/TR « nniv pipiTVPipipipipi 

TCAGTGAG GGCIAGCTACAACGA GCAGCCCC 


6004 


1545 


PITPPPPITP A r*T TP 1 A Cf^C* A 

CUGCGCUC A CUGAGGGA 


5017 


mfipnrn/-i 71 PPPTAPprpiknftTiPPTl P» A P* PiP 1 P» A P» 

1CCI- ICACj Uljl— J-AGL-IALAACGA GACjV^QjUAG 


6005 


1554 


pnnivnppTv a pittpipipip»api 
CUGAGGGA A CUGGGCAG 


5018 


/*imP(*in01\ Pi PiplpiTi 7\ PiPirrnv /"I TV 71 Plpl TV rnPiPiPirrt/-»TV /"l 

CTGCCCAG GGCIAGCTACAACGA TCCCTCAG 


6006 


1 rrn 


ripii > /-it tp»p' /—i piap'ttp'P'ap* 
GGAACUGG G CAbUbuAU 


5019 


GICCACIG GGCIAGCIACAACGA CCAGI ICC 


6007 


1562 


ACUGGGCA G UGGACUGG 


5020 


/-tpiTV /"~lrp/-(/-i7\ pppm»pi^fi/iji iii^nn TtpipipiprTv rtm 

CCAGTCCA GGCIAGCTACAACGA TGCCCAGT 


6008 


lbbo 


rrrvA p , ttp , P 1 a r>r tp'PT'p* r'TT 
GGLAGUGG A CUGGCCCU 


5021 


7\r-i/~t/-~>/-»/-i7\/^l pPpfPTiPP'PTiPTl T1PPT1 PPTi PTPPP 

AGGGCCAG GGCIAGCTACAACGA CCACTGCC 


6009 


1570 


P1TTP»P» A HTTP O P'P'PIT TP 1 AT TP1 

GUGGACUG G CCCUCAUC 


5022 


P7i rppiTy i~\r*r* PPPT1VPPrPI\P7lTlPP)\ PTAPTPPAP 

GATGAGGG GGCTAGCTACAACGA CAGTCCAC 


6010 


1576 


t T,r-(/--i/~i/-~i/-iT Tr~* a ttp , p , 7\p , p»7\tt 
UGGCCCUC A UCCACCAU 


5023 


TlfPPPnnPPTi ppiprpTyppmii p» 7. ppji Pi A P'P'O/^P' A 

AXGGTGGA GGCIAGCTACAACGA GAGGGCCA 


6011 


1580 


P»PtTTP»ATTP«P'» A P<PTA TT7\ 7\ P» A 

CCUCAUCC A CCAUAACA 


5024 


TGT1ATGG GGCTAGCTACAACGA GGATGAGG 


6012 


1583 


/T»TTP/*mPP "X TT7V IV P17V Ptplpl 

CAUCCACC A UAACACCC 


5025 


/*t/*i/*im^fT»niTV /^v/^i / 11 T 1 t\ ^irn tv ^ tv tv ^r*% tv /inwo/'i n m/n 

GGGTGTTA GGCTAGCTACAACGA GGTGGATG 


6013 


1586 


CCACCAUA A CACCCACC 


5026 


GGTGGGTG GGCTAGCTACAACGA TATGGTGG 


6014 


1588 


AC CAUAAC A CCCACCUC 


5027 


GAGGTGGG GGCTAGCTACAACGA GTTATGGT 


6015 


1592 


tttv ti p»tv pi pip* a nniTrTTP pit t 
UAACACCC A CCUCUGCU 


5028 


TV Pi /*i TV j*iiv /"*» pi P»PiPirn7v p^pwtiti Pi 71 Pi/"^ ti oPPmPmmn 

AGCAGAGG GGCTAGCTACAACGA GGGTGTTA 


6016 


1598 


r~\f~*T\. f~*f~r\ IP 1 T T p» OT TTTpipiTTPipi 

CCACCUCU G CUUCGUGC 


5029 


GCACGAAG GGCTAGCTACAACGA AGAGGTGG 


6017 


1603 


7 TPTT TP1 PIT TT TP! PI T TP1 PI TV PI TV P1PI 

UCUGCUUC G UGCACACG 


5030 


CGTGTGCA GGCTAGCTACAACGA GAAGCAGA 


6018 


1605 


UGCUUCGU G CACACGGU 


5031 


tv n/t/im/im/i /*ij^nfn tv /i /"wtttv ^t tv tv -n -*v tv tv /t 

ACCGTGTG GGCTAGCTACAACGA ACGAAGCA 


6019 


1607 


CUUCGUGC A CACGGUGC 


5032 


GCACCGTG GGCTAGCTACAACGA GCACGAAG 


6020 


1609 


UCGUGCAC A CGGUGCCC 


5033 


GGGCACCG GGCTAGCTACAACGA GTGCACGA 


6021 


1612 


UGCACACG G UGCCCUGG 


5034 


CCAGGGCA GGCTAGCTACAACGA CGTGTGCA 


6022 


1614 


CACACGGU G CCCUGGGA 


5035 


TCCCAGGG GGCTAGCTACAACGA ACCGTGTG 


6023 


1622 


/T /-% /I /TT T/~M~% r~* TV /"T/""*TV /I /Tf T/TTT 

GCCCUGGG A CCAGCUCU 


5036 


AGAGCTGG GGCTAGCTACAACGA CCCAGGGC 


6024 


1626 


TTP1P1P1 TV P1P17V /-1 ntirttTTtf T/l/"1 

UGGGACCA G CUCUUUCG 


5037 


/1/TTv tv tv /ttv /^i n/t/^irriTi n/im> /ii\ tv /*t*^i tv mn/im/r/inn 

CGAAAGAG GGCTAGCTACAACGA TGGTCCCA 


6025 


1637 


CUUUCGGA A CCCGCACC 


5038 


GGTGCGGG GGCTAGCTACAACGA TCCGAAAG 


6026 


1641 


CGGAACCC G CACCAAGC 


5039 


GCTTGGTG GGCTAGCTACAACGA GGGTTCCG 


6027 


1643 


GAACCCGC A CCAAGCUC 


5040 


GAGCTTGG GGCTAGCTACAACGA GCGGGTTC 


6028 


1648 


CGCACCAA G CUCUGCUC 


5041 


GAGCAGAG GGCTAGCTACAACGA TTGGTGCG 


6029 


1653 


CAAGCUCU G CUCCACAC 


5042 


GTGTGGAG GGCTAGCTACAACGA AGAGCTTG 


6030 


1658 


L/CUGCUCC A CACUGCCA 


5043 


TGGCAGTG GGCTAGCTACAACGA GGAGCAGA 


6031 


1660 


T T^l ^TF T/~l /T TV /"I TV /"1TT/*t /"T^*TV TV /T 

UGCUCCAC A CUGCCAAC 


5044 


GTTGGCAG GGCTAGCTACAACGA GTGGAGCA 


6032 


1663 


UCCACACU G CCAACCGG 


5045 


CCGGTTGG GGCTAGCTACAACGA AGTGTGGA 


6033 


1667 


CACUGCCA A CCGGCCAG 


5046 


CTGGCCGG GGCTAGCTACAACGA TGGCAGTG 


6034 


1671 


GCCAACCG G CCAGAGGA 


5047 


TCCTCTGG GGCTAGCTACAACGA CGGTTGGC 


6035 


1679 


GCCAGAGG A CGAGUGUG 


5048 


CACACTCG GGCTAGCTACAACGA CCTCTGGC 


6036 


1683 


GAGGACGA G UGUGUGGG 


5049 


CCCACACA GGCTAGCTACAACGA TCGTCCTC 


6037 


1685 


GGACGAGU G UGUGGGCG 


5050 


CGCCCACA GGCTAGCTACAACGA ACTCGTCC 


6038 


1687 


ACGAGUGU G UGGGCGAG 


5051 


CTCGCCCA GGCTAGCTACAACGA ACACTCGT 


6039 


1691 


GUGUGUGG G CGAGGGCC 


5052 


GGCCCTCG GGCTAGCTACAACGA CCACACAC 


6040 


1697 


GGGCGAGG G CCUGGCCU 


5053 


AGGCCAGG GGCTAGCTACAACGA CCTCGCCC 


6041 


1702 


AGGGCCUG G CCUGCCAC 


5054 


GTGGCAGG GGCTAGCTACAACGA CAGGCCCT 


6042 


1706 


CCUGGCCU G CCACCAGC 


5055 


GCTGGTGG GGCTAGCTACAACGA AGGCCAGG 


6043 


1709 


GGCCUGCC A CCAGCUGU 


5056 


ACAGCTGG GGCTAGCTACAACGA GGCAGGCC 


6044 


1713 


T TiT ^T^t TV /-1/~»TV /~\ /*lTT/^TT/T/*»/*1^V 

UGCCACCA G CUGUGCGC 


5057 


GCGCACAG GGCTAGCTACAACGA TGGTGGCA 


6045 


1716 


CACCAGCU G UGCGCCCG 


5058 


CGGGCGCA GGCTAGCTACAACGA AGCTGGTG 


6046 


1718 


CCAGCUGU G CGCCCGAG 


5059 


CTCGGGCG GGCTAGCTACAACGA ACAGCTGG 


6047 


1720 


AGCUGUGC G CCCGAGGG 


5060 


CCCTCGGG GGCTAGCTACAACGA GCACAGCT 


6048 


1728 


GCCCGAGG G CACUGCUG 


5061 


CAGCAGTG GGCTAGCTACAACGA CCTCGGGC 


6049 


1730 


CCGAGGGC A CUGCUGGG 


5062 


CCCAGCAG GGCTAGCTACAACGA GCCCTCGG 


6050 


1733 


AGGGCACU G CUGGGGUC 


5063 


GACCCCAG GGCTAGCTACAACGA AGTGCCCT 


6051 


1739 


CUGCUGGG G UCCAGGGC 


5064 


GCCCTGGA GGCTAGCTACAACGA CCCAGCAG 


6052 


1746 


GGUCCAGG G CCCACCCA 


5065 


TGGGTGGG GGCTAGCTACAACGA CCTGGACC 


6053 


1750 


CAGGGCCC A CCCAGUGU 


5066 


ACACTGGG GGCTAGCTACAACGA GGGCCCTG 


6054 



WO 02/097114 



PCT/US02/16840 



141 



1755 


pPpta PP/"»tv P nnTT^TT/*ii» Tv 

CCCACCCA G UGUGUCAA 


5067 


TTGAGACA GGCIAGLTACAACGA TGGGTGGG 


6055 


1 TCI 


papppaptt p ttpttp tv. a ptt 
IJALL.L-AGU G UbULiiALU 


5068 


AG 1 1 oAGA LjGv, I AbL 1 AGAAGGA AL 1 GGG 1 G 


6056 


1 7CO 


tA-v-AoUGU G UGAAL.UGG 


5069 


P , P 1 7\P"T ,r PP , A P'P'P"P7A/~ , P , *T»n P> A 7*. P*P« A A P 1 A P*T'/^P'P' 

GCAGITGA GGCUAGCrrACAACGA ACACTGGG 


6057 


i/bj 


pt tpttpttptv a pt tp pa ppp 
GUGUGUGA A GUGGAoGL. 


5070 


/—i /"-I P"Tipt fi TV PPPT'APPln7>PA7vPP7\ fPP 1 A PiA P<A P> 

GGG1GGAG GGG1 AGG1 AGAAGGA 1GAGAGAG 


6058 


i 'ob 


ttpttptaaptt p papppaptt 


5071 


SPTPPPTP PPPTAPPTTl P*A A P 1 P« A APTTPAPA 

AL.ivjoL.lvj 00LIA0L.IALAALGA AollGAL-A 


6059 


1 TCQ 


paapitppa p pprpttitpp 


5072 


P'P 7i Ti P^PP^P 1 PPPTAPPTAPA AP"P A TPPRPTTV 1 

IjbAAv, 1 GG 1 AoL. 1 AL-AALVjA 1 bLrtu X 1 G 


6060 


1 777 
JL f f 3 


TTPPAPpPA P ITTTPPTTTTPP 


5073 


PPTATiPPAA PPPTAPPTAPA APPA TP , P»P"T , P'P , A 

1 L0AA00AA butlAbHALAALvjA 1GGG1GLA 


6061 


x / a 


PPTTTTPPPP P PPRPPAPTT 


5074 


a PTPPTPP PPPTAPPTAPA AP'P'A PPPPAAPP 

AL. 1 1 oG but 1 AbL I Av-AALLtA LLLbrAAGG 


6062 


1 7Q1 


nnrnannia p ttpppttppa. 


5075 


TPP^PPPA ppptapptapa appa r vr > r ,T vrT*r t r > 
IL-LALoGA bbL i. Abb 1 ALAAbbA ILLlbbbb 


6063 


j. / 


fPAPPArn p ppttppapp 
L.LAvjoALtU Vj COUOLtAGG 


5076 


L.GJ.GLAGG GGGlAGGiAuAALGA ALICCrGG 


6064 


1 / 3j 


appb.pt tpp p ttpp app a 7\ 
AooAoUGL, o UooAoGAA 


5077 


TTPPTPPA PPPT APPTA P A A PP A PP7\ PTPPT 

llbblbbA GGG1AGGJ.AGAAGGA GGALIGG1 


6065 




PT TPP 7\ PP A A T TP PPOT\ PTT 

GUGGAGGA A UGGGGAGU 


5078 


tv p^ti/^/™»<*^/*«tv PPPffl Tv /-»pititv /^ T\ tv nn TV tTIP/ >l n^l^l TV /*i 

ACTCGGCA GGCTAGCTACAACGA TCCTCCAC 


6066 


XoUd 


PPAPHA Tv TT P PPP A PT TAP 

GGAGGAAU G GGGAGUAG 


5079 


PT1\P»Pr<PP /-tPPT>TV PPTIJV /^TV TV 7V TVTIT>/*» 1^1 »n /*•/*■» 

G1AGXCGG GGCIAGCrrACAACGA ATTCCTCC 


6067 


-lo J. U 


A 7\ T TP PPP 7V P TT7\ PTTPPTt P 

AAUGGLGA G UACUGCAG 


5080 


CTGCAGTA GGCTAGCrrACAACGA TCGGCATT 


6068 


18X2 


UGCCoAGu A GUGLAGGG 


5081 


CCCTGCAG GGCTAGCTACAACGA ACTCGGCA 


6069 


1815 


pptv ptttv /~tTT p /*nvr»or</^nTT 

CGAGUACU G CAGGGGCU 


5082 


AGCCCCTG GGCTAGCTACAACGA AGTACTCG 


6070 


1821 


CUGCAGGG G CUCCCCAG 


5083 


CTGGGGAG GGCTAGCTACAACGA CCCTGCAG 


6071 




pppt* ppptv p iTHTT^irnw n 

CCCAGGGA G UAUGUGAA 


5084 


TTCACATA GGCTAGCTACAACGA TCCCTGGG 


6072 


1835 


CAGGGAGU A UGUGAAUG 


5085 


CATTCACA GGCTAGCTACAACGA ACTCCCTG 


6073 


1837 


GGGAGUAU G UGAAUGCC 


5086 


GGCATTCA GGCTAGCTACAACGA ATACTCCC 


6074 


1841 


GUAUGUGA A UGCCAGGC 


5087 


GCCTGGCA GGCTAGCTACAACGA TCACATAC 


6075 


1843 


•A T TPT TP TV TV T Y P PP TV P P P TV P 

AUGUGAAU G CCAGGCAC 


5088 


GTGCCTGG GGCTAGCTACAACGA ATTCACAT 


6076 


1848 


7\ TV T T/"1 f*y TV /*t TV /"l T T/"l Y T7 TT T 

AAUGCCAG G CACUGUuU 


5089 


AAACAGTG GGCTAGCTACAACGA CTGGCATT 


6077 


lob U 


TTPPPi\nrir' 71 pttpttttttpp 
UGLCAGGC A CUGUUUGC 


5090 


GCAAACAG GGCTAGCTACAACGA GCCTGGCA 


6078 


1 Q C T 

lab3 


/-tTi pp p Tv /-ITT p TTTTIlPPrT'TT 

CAGGCACU G UUUGCCGU 


5091 


Tv /~i /^i t\ tv tv nnnm tv nm is ^itv tv /T^t tv tv r^mr* /— t/^m y^i 

ACGGCAAA GGCTAGCTACAACGA AGTGCCTG 


6079 


1 OCT 


PAPTTPTTTTTT P PPPI TPPP7V 


5092 


nr«/*iO/^7v r*t~in /^/-»/~i»t»tv nnrnn /itv tv /m/t»v 7v tv ti i^tv /im/i 

TGGCACGG GGCTAGCTACAACGA AAACAGTG 


6080 


1 Q C A 

XobU 


TTPTTTTTTPPP P TTPPP A PPP 


5093 


GGGTGGCA GGCTAGCTACAACGA GGCAAACA 


6081 


lot)/ 


TTTTTTPPPPTT P PP1\PP PT TP 

UUUIjtLbU G GCAGGGUG 


5094 


/"i/'i/Tn/'t/t nnrrpTi nrifnn <^iti t> tv tv /-«^/-i/~»tv tv tv 

CAGGGTGG GGCTAGCTACAACGA ACGGCAAA 


6082 


IDOj 


PPPPTTPPP Tv PPPTTPAPTT 


5095 


7v /-MT 1 / - ' tv /-»/-! /~i /-i/~«/""!rpTV /^tpirpiv /*^tv tv n^»Tv nnniv /^/-i/-^ /-^ 

AG1GAGGG GGG X AGLTACAACGA GGCACGGC 


6083 


i mo 
J. o / z 


CTi CTT^l TP" 7\ P TTPTTPAPPP 


5096 


GGLXGACA GGC1 AGLTACAACGA TCAGGGTG 


6084 


1 0 "7 A 
lo / 1 


PPPITPAPTT P TTPAPPP'P'P' 


5097 


PP PP PI7T/"t TV PPPTPTv PPITITV PTV TV PP TV TVPITIPTIPPP 

GGGGCTGA GGCTAGCTACAACGA ACTCAGGG 


6085 


1878 


GAGUGUCA G CCCCAGAA 


5098 


TTCTGGGG GGCTAGCTACAACGA TGACACTC 


6086 


1886 


GCCCCAGA A UGGCUCAG 


5099 


CTGAGCCA GGCTAGCTACAACGA TCTGGGGC 


6087 


1889 


CCAGAAUG G CTJCAGUGA 


5100 


TCACTGAG GGCTAGCTACAACGA CATTCTGG 


6088 


1894 


AUGGCUCA G UGACCUGU 


5101 


ACAGGTCA GGCTAGCTACAACGA TGAGCCAT 


6089 


1897 


GCUCAGUG A CCUGUUUU 


5102 


AAAACAGG GGCTAGCTACAACGA CACTGAGC 


6090 


1901 


AGUGACCU G UUUUGGAC 


5103 


GTCCAAAA GGCTAGCTACAACGA AGGTCACT 


6091 


1908 


UGUUUUGG A CCGGAGGC 


5104 


GCCTCCGG GGCTAGCTACAACGA CCAAAACA 


6092 


1915 


GACCGGAG G CUGACCAG 


5105 


CTGGTCAG GGCTAGCTACAACGA CTCCGGTC 


6093 


1919 


GGAGGCUG A CCAGUGUG 


5106 


CACACTGG GGCTAGCTACAACGA CAGCCTCC 


6094 


1923 


GCUGACCA G UGUGUGGC 


5107 


GCCACACA GGCTAGCTACAACGA TGGTCAGC 


6095 


1925 


UGACCAGU G UGUGGCCU 


5108 


AGGCCACA GGCTAGCTACAACGA ACTGGTCA 


6096 


1927 


TV P1/"17V OttPT T O TTV»/""»r"«/"^YT>'"1TT 

ACCAGUGU G UGGCCUGU 


5109 


ACAGGCCA GGCTAGCTACAACGA ACACTGGT 


6097 


1930 


AGUGUGUG G CCUGUGCC 


5110 


GGCACAGG GGCTAGCTACAACGA CACACACT 


6098 


1934 


UGUGGCCU G UGCCCACU 


5111 


AGTGGGCA GGCTAGCTACAACGA AGGCCACA 


6099 


1936 


UGGCCUGU G CCCACUAU 


5112 


ATAGTGGG GGCTAGCTACAACGA ACAGGCCA 


6100 


1940 


CUGUGCCC A CUAUAAGG 


5113 


CCTTATAG GGCTAGCTACAACGA GGGCACAG 


6101 


1943 


UGCCCACU A UAAGGACC 


5114 


GGTCCTTA GGCTAGCTACAACGA AGTGGGCA 


6102 


1949 


CUAUAAGG A CCCUCCCU 


5115 


AGGGAGGG GGCTAGCTACAACGA CCTTATAG 


6103 


1961 


UCCCUUCU G CGUGGCCC 


5116 


GGGCCACG GGCTAGCTACAACGA AGAAGGGA 


6104 


1963 


CCUUCUGC G UGGCCCGC 


5117 


GCGGGCCA GGCTAGCTACAACGA GCAGAAGG 


6105 


1966 


UCUGCGUG G CCCGCUGC 


5118 


GCAGCGGG GGCTAGCTACAACGA CACGCAGA 


6106 
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1970 


CGUGGCCC G CUGCCCCA 


5119 


TGGGGCAG GGCTAGCTACAACGA GGGCCACG 


6107 


1973 


GGCCCGCU G CCCCAGCG 


5120 


CGCTGGGG GGCTAGCTACAACGA AGCGGGCC 


6108 


1979 


CUGCCCCA G CGGUGUGA 


5121 


TPHP^nrip PPPT'APPmTv P7v a pp a T>PPP r~> /"< 7\ r~< 

1CACACCG GGCTAGCTACAACGA TGGGGCAG 


6109 


1982 


pprnAr" pp p t tpt tp a 7\ tv p 

CCCCAGCG G UGUGAAAC 


5122 


prprrwypTV PA PP PT< A /"^ Pm TV /"? TV TV A PP/TT»PPPP 

GTTTCACA GGCTAGCTACAACGA CGCTGGGG 


6110 


1984 


PPA PA/^OTT P T TP A TV A PP*T T 

CCAGCGGU g ugaaaccu 


5123 


APPT"T"T'PA ppprpTV p prpTV PTV A PPA A PPPPT»PP 

AGGIITCA GGCTAGCTACAACGA ACCGCTGG 


6111 


1989 


PPT TPT TP 7\ 71 7* PPTTPA ppt T 

GGUGUGAA A ccugaccu 


5124 


■ftPPTPAPP ppp»Ti7vpprpA P A A PP A rprp/iTi P7V PP 

AGG I CAGG GGCTAGCTACAACGA 1 I CACACC 


6112 


1994 


P A A A PPT TP A PPTTPT TPPTT 


5125 


APPAPAPP pppryiTV ppT»7\ PA A PP A PTirPrpTTP 
AvjtjAvjrALilj vjVjL,1AvjtV_1 ACAACoA L-Avjvj 1 J. I C 


6113 


2003 


PPITPTTPPTT A PMTPPPP7\ 

CLIK-UCCU A CAUGl-CCA 


5126 


T'PPPPATTP PPPT^TV P Pm TV P TV A PP A APPAPAPP 

HjGGCAIG GGCIAvjLTACAACGA AGGAGAGG 


6114 


2005 


T TPI Tri/-rr T A f"» A TTPPPP7VTTP 

UCUCCUAC A UbCCCAUC 


5127 


P A TPPP PTV PPPT*APP*T»A P A A PP A PT»APPT\PA 

GATGGGCA GGCTAGCTACAACGA GTAGGAGA 


6115 


2007 


T TPPTTA P AIT P PPPATTPTTP 


5128 


PAPA TP PP ppprrtTv pprrtTV PA A PPA AT*P«T»APPA 

CAGAIGGG GGCTAGCTACAACGA ATGTAGGA 


6116 


£ U J. 1 


7\ PSTTPPPP A T TPT 7PP A A P 


5129 


PT"TPP A P 7\ PP PT 1 A P prpTV p A A PP A rT t r>/^1\rrr*'T 

CI !L.CAijrA VjIjCIAVjCIACAACGA GbGCAlGJ. 


6117 


O Pi T Q 


A T TP 1 ! TP 1 P A A P TTTTT7PPAPA 

AU C U IjCjAA G U U U C L AQj a 


5130 


TPTPPAAA pp pirn A pprp A P A A PP A rPTPPTV PAT" 

1 L 1 GGAAA GGL 1 AGC TAG AACGA 1 1 CCAGAT 


6118 


zu^ / 


P 1 ^TTTrT^ , p , AP , a t7P , 7ap , p i app 


5131 


PPT PPT 1 PA PPPT A PPT A P A A PP A P'T'PP A A A P 

bblv-UILA bbUlAvjUlACAACGA L-xbGAAAC 


6119 


2036 


t TO A PP A pp P PP P A T TP PP 

UGAGGAGG G CGCAUGCC 


5132 


PPPAT»PPP PPPT APPTIA PA A PP A pprppprpPTV 

GGCAIGCG GGCTAGCTACAACGA CCTCCTCA 


6120 


/Uo o 


APPAP»PPP p P71TIPPP7IP 

AvjVjACjCjCjC G CAUCjCCACj 


5133 


PTPPPST1P PPP^T 1 APPTrpTl P A A PP A P^ 'PPPT»PPT» 

CIGGCATG GGCTAGCTACAACGA GCCCTCCT 


6121 


204 0 


PAPPPPPP A T7PPPAPPP 

GAGGGCGC A UGCCAGCC 


5134 


PPPTO^OH PPPTAPPTA PA A PPA PPPPPPrnP 

GGCTGGCA GGCTAGCTACAACGA GCGCCCTC 


6122 


2042 


GGGCGCAU G CCAGCCUU 


5135 


AAGGCTGG GGCTAGCTACAACGA ATGCGCCC 


6123 


2046 


GCAUGCCA G CCUUGCCC 


5136 


ft /1 f*T\ tv f«ft ft ft ft rn tv nnmTt ft t\ tv ft ft tv mfi/-i/-i>i mftfi 

GGGCAAGG GGCTAGCTACAACGA TGGCATGC 


6124 


2051 


CCAGCCUU G CCCCAUCA 


5137 


TGATGGGG GGCTAGCTACAACGA AAGGCTGG 


6125 


2056 


CUUGCCCC A UCAACUGC 


5138 


GCAGTTGA GGCTAGCTACAACGA GGGGCAAG 


6126 


2060 


CCCCAUCA A CUGCACCC 


5139 


GGGTGCAG GGCTAGCTACAACGA TGATGGGG 


6127 


2063 


CAUCAACU G CACCCACU 


5140 


AGTGGGTG GGCTAGCTACAACGA AGTTGATG 


6128 


2065 


UCAACUGC A CCCACUCC 


5141 


GGAGTGGG GGCTAGCTACAACGA GCAGTTGA 


6129 


2069 


CUGCACCC A CUCCUGUG 


5142 


CACAGGAG GGCTAGCTACAACGA GGGTGCAG 


6130 


2075 


CCACUCCU G UGUGGACC 


5143 


GGTCCACA GGCTAGCTACAACGA AGGAGTGG 


6131 


2077 


ACUCCUGU G UGGACCUG 


5144 


CAGGTCCA GGCTAGCTACAACGA ACAGGAGT 


6132 


2081 


CUGUGUGG A CCUGGAUG 


5145 


CATC CAGG GGCTAGCTACAACGA CCACACAG 


6133 


2087 


fl fl tv fT ftT T/^l/^t TV TT/1 TV /*1TV TV f*t/T 

GGACCUGG A UGACAAGG 


5146 


CCTTGTCA GGCTAGCTACAACGA CCAGGTCC 


6134 


2090 


PPTTPP TV TV OHHP/*>rtrtTT 

CCUGGAUG A CAAGGGCU 


5147 


AGCCCTTG GGCTAGCTACAACGA CATCCAGG 


6135 


2096 


UGACAAGG G CUGCCCCG 


5148 


CGGGGCAG GGCTAGCTACAACGA CCTTGTCA 


6136 


2099 


CAAGGGCU G CCCCGCCG 


5149 


CGGCGGGG GGCTAGCTACAACGA AGCCCTTG 


6137 


21 04 


GCUGCCCC G CCGAGCAG 


5150 


fiirift fim f»fi ft /—1 / srnTi ft ftrn t» ft tv tv ft/*t t\ fiftftft ftiv ftf* 

CTGCTCGG GGCTAGCTACAACGA GGGGCAGC 


6138 


2109 


ftfl flft ftfl Tv ft ft >\ 7\ fl t\ /-« /"I 

CCCGCCGA G CAGAGAGC 


5151 


GCTCTCTG GGCTAGCTACAACGA TCGGCGGG 


6139 


2116 


AGCAGAGA G CCAGCCCU 


5152 


AGGGCTGG GGCTAGCTACAACGA TCTCTGCT 


6140 


2120 


GAGAGCCA G CCCUCUGA 


5153 


TCAGAGGG GGCTAGCTACAACGA TGGCTCTC 


6141 


2128 


GCCCUCUG A CGUCCAUC 


5154 


GATGGACG GGCTAGCTACAACGA CAQAGGGC 


6142 


2130 


CCUCUGAC G UCCAUCAU 


5155 


ATGATGGA GGCTAGCTACAACGA GTCAGAGG 


6143 


2134 


UGACGUCC A UCAUCUCU 


5156 


AGAGATGA GGCTAGCTACAACGA GGACGTCA 


6144 


2137 


CGUCCAUC A UCUCUGCG 


5157 


CGCAGAGA GGCTAGCTACAACGA GATGGACG 


6145 


2143 


UCAUCUCU G CGGUGGUU 


5158 


AACCACCG GGCTAGCTACAACGA AGAGATGA 


6146 


2146 


UCUCUGCG G UGGUUGGC 


5159 


GCCAACCA GGCTAGCTACAACGA CGCAGAGA 


6147 


2149 


CUGCGGUG G UUGGCAUU 


5160 


AATGCCAA GGCTAGCTACAACGA CACCGCAG 


6148 


2153 


GGUGGUUG G CAUUCUGC 


5161 


GCAGAATG GGCTAGCTACAACGA CAACCACC 


6149 


2155 


UGGUUGGC A UUCUGCUG 


5162 


CAGCAGAA GGCTAGCTACAACGA GCCAACCA 


6150 


2160 


GGCAUUCU G CUGGUCGU 


5163 


ACGACCAG GGCTAGCTACAACGA AGAATGCC 


6151 


2164 


UUCUGCUG G UCGUGGUC 


5164 


GACCACGA GGCTAGCTACAACGA CAGCAGAA 


6152 


2167 


UGCUGGUC G UGGUCUUG 


5165 


CAAGACCA GGCTAGCTACAACGA GACCAGCA 


6153 


2170 


UGGUCGUG G UCUUGGGG 


5166 


CCCCAAGA GGCTAGCTACAACGA CACGACCA 


6154 


2179 


UCUUGGGG G UGGUCUUU 


5167 


AAAGACCA GGCTAGCTACAACGA CCCCAAGA 


6155 


2182 


UGGGGGUG G UCUUUGGG 


5168 


CCCAAAGA GGCTAGCTACAACGA CACCCCCA 


6156 


2191 


UCUUUGGG A UCCUCAUC 


5169 


GATGAGGA GGCTAGCTACAACGA CCCAAAGA 


6157 


2197 


GGAUCCUC A UCAAGCGA 


5170 


TCGCTTGA GGCTAGCTACAACGA GAGGATCC 


6158 
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2202 


CUCAUCAA G CGACGGCA 


5171 


fTiOOOOTTOO oOOTi A OOt»tv Otv TV OO A T""PO A TO A O 

TGCCGTCG GGCTAGCTACAACGA TTGATGAG 


6159 


2205 


AUCAAGCG A CGGCAGCA 


5172 


rrvw »iM/*t/~iO/-» f* O OTT TV O nm« r*\ n n r*i~* tv /^OOnrrpOTv m 

TGCTGCCG GGCTAGCTACAACGA CGCTTGAT 


6160 


2208 


7\ A O OO A OO O OAOOAOAA 

AAGCGACG G CAGCAGAA 


5173 


mmpmppmp PPPTAPprpTi OA A OOA OOT»OOO r PT' 

IICIGCIG GGCIAGCIACAACGA CGJ.CGCI.1 


6161 


ZZ11 


007\ O OA OA TV O A T T 

CGACGGCA G CAGAAGAU 


5174 


»mpmmpmp pppm a OOT»A O A A OO A nr , OOr , O r POr 1 

AICAJ.C1G GGCiAGCIACAACGA 1GCCG1CG 


6162 


ZZ la 


A 007V O A 7\/-i A T T^/^^/ -1 7\ AO 

AGCAGAAG A UCCGGAAG 


5175 


PTTPPPPTi PPPTSPPTiTl OA A OOA OT TOT OOT* 

CTTCCGGA GGCIAGCTACAACGA CIILIGCT 


6163 


2226 


AUCCGGAA G UACACGAU 


5176 


7v fppPTPTTi OOOfA OOfPA O A A OOA TTPPPPUT 

A1CG1GIA GGCJLAGCTACAACGA IICLGGA1 


6164 




CCGGAAGU A CACGAUGC 


5177 . 


007\ T'OOT'O OO PT A O Orp 7V O 7V TV OO A TV OT'T'OOOO 

GCATCGTG GGCTAGCTACAACGA ACTTCCGG 


6165 




ppji A OTTA O A OO A T to oo o 

GGAAGUAC A CGAUGCGG 


5178 


/-i(~>/-~t/^i A T^OO OOOTAOOTA O A A OO A OTA PTTPP 

CCGCA1CG GGCIAGCIACAACGA GIAL1ICC 


6166 


O O O 1 


AGUACACG A UGLGGAGA 


5179 


rp/' w I <0 OO O TV OOOTTTV OOrriTV O TV TV OOTV OOT>0 Ti 7V /"irn 

TCTCCGCA GGCIAGCTACAACGA CGTGTACT 


6167 


zzo o 


TTA OA 00 ATT O OOOAOAOTT 

U ACACGAU G CGGAGAC U 


5180 


7\ /~^rr\f~trrif^/~\/-^ OOOTA OOT A O A A OO A RTPPTPTA 

AGTCTCCG GGCTAGCTACAACGA ATCGTGTA 


6168 




A7TOOOOAO A OTTOOTTOOA 

AUGGGGAG A CUGCUGCA 


5181 


rpooTvooivo ooot>tv oomTv *""»tv tv no tv irnnprmivfii 

TGCAGCAG GGCTAGCTACAACGA CTCCGCAT 


6169 


2244 


OOOAOAOTT O OTTOOAOOA 

LGGAGAGU G CUGCAGGA 


5182 


rpOOTiOOAO OO OT A OOT A OA A OO A AOT'O'POOO 

TCCTGCAG GGCIAGLTACAACGA AGTCTCCG 


6170 


2247 


AOAOTTOOTT o oaooaaao 

AGACUGCU G CAGGAAAC 


5183 


ot i iii hi ioorno ooomTv oormv mv tv r\r-* t\ unrnvrirnnm 

GTTTCCTG GGCTAGCTACAACGA AGCAGTCT 


6171 


2254 


TTO/"ITV ^"»/"tTV TV TV OOO TV OOT T/"1 

UGCAGGAA A CGGAGCUG 


5184 


CAGCTCCG GGCTAGCTACAACGA TTCCTGCA 


6172 




OA A A OOO A O OI TOOT TOO A 

GAAAGGGA G GUGGUGGA 


5185 


«T»OOAOOAO OOOT 1 AOOT A OA A OOA fPOOOT>T>TiO 

ICCACCAG GGC1AGCTACAACGA TCCGTTTC 


6173 


2263 


OOOAOOTTO O TTOOAOOOO 

CGGAGCUG G UGGAGCCG 


5186 


e~*r>r\<~ymi~yr*i\ ooot»A oomTv otv 7\ ootv otv oomo/~»/~t 

CGGCTCCA GGCTAGCTACAACGA CAGCTCCG 


6174 


o o 


OTTOOTTOOA O OOOOTTOAO 

CUGGUGGA G CCGCUGAC 


5187 


/tp/""»tv nooo oo om7v oomTv nTi tv /n/^ tv m^"i/"»iv n/^Tv /*n 

GTCAGCGG GGCTAGCTACAACGA TCCACCAG 


6175 


O O *7 1 
ZZ / ± 


OT TOO A O OO O OTTO A OA OO 


5188 


ppmpmpi\p PPplPAPPTTl OA A OOA /^irp/^/~>7\ /-i 

GGIajICAG GGCiAGCIACAACGA GGCTCCAC 


6176 


2275 


AOOOOOTTO TV OTVOOTTTNOO 

AGCCGC U G A GAC C UAG C 


5189 


GCTAGGTG GGCTAGCTACAACGA CAGCGGCT 


6177 


2277 


CCGCUGAC A CCUAGCGG 


5190 


CCGCTAGG GGCTAGCTACAACGA GTCAGCGG 


6178 


2282 


GACACCUA G CGGAGCGA 


5191 


TCGCTCCG GGCTAGCTACAACGA TAGGTGTC 


6179 


2287 


CUAGCGGA G CGAUGCCC 


5192 


GGGCATCG GGCTAGCTACAACGA TCCGCTAG 


6180 


2290 


GCGGAGCG A UGCCCAAC 


5193 


GTTGGGCA GGCTAGCTACAACGA CGCTCCGC 


6181 


2292 


/T, ^» TV y^i /■* ■* f f /"I y**1/"1/"1TV *m ^*/-*TV 

GGAGCGAU G CCCAACCA 


5194 


TGGTTGGG GGCTAGCTACAACGA ATCGCTCC 


6182 


2297 


GAUGCCCA A CCAGGCGC 


5195 


GCGCCTGG GGCTAGCTACAACGA TGGGCATC 


6183 


2302 


CCAACCAG G CGCAGAUG 


5196 


CATCTGCG GGCTAGCTACAACGA CTGGTTGG 


6184 


2304 


AACCAGGC G CAGAUGCG 


5197 


CGCATCTG GGCTAGCTACAACGA GCCTGGTT 


6185 


2308 


AGGCGCAG A UGCGGAUC 


5198 


GATCCGCA GGCTAGCTACAACGA CTGCGCCT 


6186 


2310 


GCGCAGAU G CGGAUCCU 


5199 


AGGATCCG GGCTAGCTACAACGA ATCTGCGC 


6187 


2314 


AGAUGCGG A UCCUGAAA 


5200 


TTTCAGGA GGCTAGCTACAACGA CCGCATCT 


6188 


2326 


T TO 71 TV 71 f* 7\ O TV OOOTVOOTT/"t 

UGAAAGAG A CGGAGCUG 


5201 


CAGCTCCG GGCTAGCTACAACGA CTCTTTCA 


6189 


2331 


/■"« TV /"I TV *"ITV /"I rtTT/1»i1/*1» H 

GAGACGGA G CUGAGGAA 


5202 


TTCCTCAG GGCTAGCTACAACGA TCCGTCTC 


6190 


2341 


UGAGGAAG G UGAAGGUG 


5203 


CACCTTCA GGCTAGCTACAACGA CTTCCTCA 


6191 


2347 


TV /««rtf TV TV /~1 /—I TTy p t/"iT TT TVT /~t TV 

AGGUGAAG G UGCUUGGA 


5204 


TCCAAGCA GGCTAGCTACAACGA CTTCACCT 


6192 


2349 


GUGAAGGU G CUUGGAUC 


5205 


GATCCAAG GGCTAGCTACAACGA ACCTTCAC 


6193 


2355 


GUGCUUGG A UCUGGCGC 


5206 


GCGCCAGA GGCTAGCTACAACGA CCAAGCAC 


6194 


2360 


UGGAUCUG G CGCUUUUG 


5207 


CAAAAGCG GGCTAGCTACAACGA CAGATCCA 


6195 


2362 


GAUCUGGC G CUUUUGGC 


5208 


GCCAAAAG GGCTAGCTACAACGA GCCAGATC 


6196 


2369 


CGCUUUUG G CACAGUCU 


5209 


AGACTGTG GGCTAGCTACAACGA CAAAAGCG 


6197 


2371 


CUUUUGGC A CAGUCUAC 


5210 


GTAGACTG GGCTAGCTACAACGA GCCAAAAG 


6198 


2374 


UUGGCACA G UCUACAAG 


5211 


CTTGTAGA GGCTAGCTACAACGA TGTGCCAA 


6199 


2378 


CACAGUCU A CAAGGGCA 


5212 


TGCCCTTG GGCTAGCTACAACGA AGACTGTG 


6200 


2384 


CUACAAGG G CAUCUGGA 


5213 


TCCAGATG GGCTAGCTACAACGA CCTTGTAG 


6201 


2386 


ACAAGGGC A UCUGGAUC 


5214 


GATCCAGA GGCTAGCTACAACGA GCCCTTGT 


6202 


2392 


GCAUCUGG A UCCCUGAU 


5215 


ATCAGGGA GGCTAGCTACAACGA CCAGATGC 


6203 


2399 


GAUCCCUG A UGGGGAGA 


5216 


TCTCCCCA GGCTAGCTACAACGA CAGGGATC 


6204 


2408 


UGGGGAGA A UGUGAAAA 


5217 


TTTTCACA GGCTAGCTACAACGA TCTCCCCA 


6205 


2410 


GGGAGAAU G UGAAAAUU 


5218 


AATTTTCA GGCTAGCTACAACGA ATTCTCCC 


6206 


2416 


AUGUGAAA A UUCCAGUG 


5219 


CACTGGAA GGCTAGCTACAACGA TTTCACAT 


6207 


2422 


AAAUUCCA G UGGCCAUC 


5220 


GATGGCCA GGCTAGCTACAACGA TGGAATTT 


6208 


2425 


UUCCAGUG G CCAUCAAA 


5221 


TTTGATGG GGCTAGCTACAACGA CACTGGAA 


6209 


2428 


CAGUGGCC A UCAAAGUG 


5222 


CACTTTGA GGCTAGCTACAACGA GGCCACTG 


6210 
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2434 


CCAUCAAA G UGUUGAGG 


5223 


CCTCAACA GGCTAGCTACAACGA TTTGATGG 


6211 


2436 


AUCAAAGU G UUGAGGGA 


5224 


TCCCTCAA GGCTAGCTACAACGA ACTTTGAT 


6212 


2447 


GAGGGAAA A CACAUCCC 


5225 


GGGATGTG GGCTAGCTACAACGA TTTCCCTC 


6213 


2449 


GGGAAAAC A CAUCCCCC 


5226 


GGGGGATG GGCTAGCTACAACGA GTTTTCCC 


6214 


2451 


GAAAACAC A UCCCCCAA 


5227 


TTGGGGGA GGCTAGCTACAACGA GTGTTTTC 


6215 


2461 


CCCCCAAA G CCAACAAA 


5228 


TTTGTTGG GGCTAGCTACAACGA TTTGGGGG 


6216 


2465 


CAAAGCCA A CAAAGAAA 


5229 


TTTCTTTG GGCTAGCTACAACGA TGGCTTTG 


6217 


2473 


ACAAAGAA A UCUUAGAC 


5230 


GTCTAAGA GGCTAGCTACAACGA TTCTTTGT 


6218 


2480 


AAUCUUAG A CGAAGCAU 


5231 


ATGCTTCG GGCTAGCTACAACGA CTAAGATT 


6219 


2485 


UAGACGAA G CAUACGUG 


5232 


CACGTATG GGCTAGCTACAACGA TTCGTCTA 


6220 


2487 


GACGAAGC A UACGUGAU 


5233 


ATCACGTA GGCTAGCTACAACGA GCTTCGTC 


6221 


2489 


CGAAGCAU A CGUGAUGG 


5234. 


CCATCACG GGCTAGCTACAACGA ATGCTTCG 


6222 


2491 


AAGCAUAC G UGAUGGCU 


5235 


AGCCATCA GGCTAGCTACAACGA GTATGCTT 


6223 


2494 


CAUACGUG A UGGCUGGU 


5236 


ACCAGCCA GGCTAGCTACAACGA CACGTATG 


6224 


2497 


ACGUGAUG G CUGGUGUG 


5237 


CACACCAG GGCTAGCTACAACGA CATCACGT 


6225 


2501 


GAUGGCUG G UGUGGGCU 


5238 


AGCCCACA GGCTAGCTACAACGA CAGCCATC 


6226 


2503 


UGGCUGGU G UGGGCUCC 


5239 


GGAGCCCA GGCTAGCTACAACGA ACCAGCCA 


6227 


2507 


UGGUGUGG G CUCCCCAU 


5240 


ATGGGGAG GGCTAGCTACAACGA CCACACCA 


6228 


2514 


GGCUCCCC A UAUGUCUC 


5241 


GAGACATA GGCTAGCTACAACGA GGGGAGCC 


6229 


2516 


CUCCCCAU A UGUCUCCC 


5242 


GGGAGACA GGCTAGCTACAACGA ATGGGGAG 


6230 


2518 


CCCCAUAU G UCUCCCGC 


5243 


GCGGGAGA GGCTAGCTACAACGA ATATGGGG 


6231 


2525 


UGUCUCCC G CCUUCUGG 


5244 


CCAGAAGG GGCTAGCTACAACGA GGGAGACA 


6232 


2534 


CCUUCUGG G CAUCUGCC 


5245 


GGCAGATG GGCTAGCTACAACGA CCAGAAGG 


6233 


2536 


UUCUGGGC A UCUGCCUG 


5246 


CAGGCAGA GGCTAGCTACAACGA GCCCAGAA 


6234 


2540 


GGGCAUCU G CCUGACAU 


5247 


ATGTCAGG GGCTAGCTACAACGA AGATGCCC 


6235 


2545 


UCUGCCUG A CAUCCACG 


5248 


CGTGGATG GGCTAGCTACAACGA CAGGCAGA 


6236 


2547 


UGCCUGAC A UCCACGGU 


5249 


ACCGTGGA GGCTAGCTACAACGA GTCAGGCA 


6237 


2551 


UGACAUCC A CGGUGCAG 


5250 


CTGCACCG GGCTAGCTACAACGA GGATGTCA 


6238 


2554 


CAUCCACG G UGCAGCUG 


5251 


CAGCTGCA GGCTAGCTACAACGA CGTGGATG 


6239 


2556 


UCCACGGU G CAGCUGGU 


5252 


ACCAGCTG GGCTAGCTACAACGA ACCGTGGA 


6240 


2559 


ACGGUGCA G CUGGUGAC 


5253 


GTCACCAG GGCTAGCTACAACGA TGCACCGT 


6241 


2563 


UGCAGCUG G UGACACAG 


5254 


CTGTGTCA GGCTAGCTACAACGA CAGCTGCA 


6242 


2566 


AGCUGGUG A CACAGCUU 


5255 


AAGCTGTG GGCTAGCTACAACGA CACCAGCT 


6243 


2568 


CUGGUGAC A CAGCUUAU 


5256 


ATAAGCTG GGCTAGCTACAACGA GTCACCAG 


6244 


2571 


GUGACACA G CUUAUGCC 


5257 


GGCATAAG GGCTAGCTACAACGA TGTGTCAC 


6245 


2575 


CACAGCUU A UGCCCUAU 


5258 


ATAGGGCA GGCTAGCTACAACGA AAGCTGTG 


6246 


2577 


CAGCUUAU G CCCUAUGG 


5259 


CCATAGGG GGCTAGCTACAACGA ATAAGCTG 


6247 


2582 


UAUGCCCU A UGGCUGCC 


5260 


GGCAGCCA GGCTAGCTACAACGA AGGGCATA 


6248 


2585 


GCCCUAUG G CUGCCUCU 


5261 


AGAGGCAG GGCTAGCTACAACGA CATAGGGC 


6249 


2588 


CUAUGGCU G CCUCUUAG 


5262 


CTAAGAGG GGCTAGCTACAACGA AGCCATAG 


6250 


2597 


CCUCUUAG A CCAUGUCC 


5263 


GGACATGG GGCTAGCTACAACGA CTAAGAGG 


6251 


2600 


CUUAGACC A UGUCCGGG 


5264 


CCCGGACA GGCTAGCTACAACGA GGTCTAAG 


6252 


2602 


UAGACCAU G UCCGGGAA 


5265 


TTCCCGGA GGCTAGCTACAACGA ATGGTCTA 


6253 


2612 


CCGGGAAA A CCGCGGAC 


5266 


GTCCGCGG GGCTAGCTACAACGA TTTCCCGG 


6254 


2615 


GGAAAACC G CGGACGCC 


5267 


GGCGTCCG GGCTAGCTACAACGA GGTTTTCC 


6255 


2619 


AACCGCGG A CGCCUGGG 


5268 


CCCAGGCG GGCTAGCTACAACGA CCGCGGTT 


6256 


2621 


CCGCGGAC G CCUGGGCU 


5269 


AGCCCAGG GGCTAGCTACAACGA GTCCGCGG 


6257 


2627 


ACGCCUGG G CUCCCAGG 


5270 


CCTGGGAG GGCTAGCTACAACGA CCAGGCGT 


6258 


2636 


CUCCCAGG A CCUGCUGA 


5271 


TCAGCAGG GGCTAGCTACAACGA CCTGGGAG 


6259 


2640 


CAGGACCU G CUGAACUG 


5272 


CAGTTCAG GGCTAGCTACAACGA AGGTCCTG 


6260 


2645 


CCUGCUGA A CUGGUGUA 


5273 


TACACCAG GGCTAGCTACAACGA TCAGCAGG 


6261 


2649 


CUGAACUG G UGUAUGCA 


5274 


TGCATACA GGCTAGCTACAACGA CAGTTCAG 


6262 
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2651 


GAACUGGU G UAUGCAGA 


5275 


TCTGCATA GG CTAG CTACAACGA ACCAGTTC 


6263 


2653 


ACUGGUGU A UGCAGAUU 


5276 


AATCTGCA GG CTAG CTACAACGA ACACCAGT 


6264 


2655 


UGGUGUAU G CAGAUUGC 


5277 


GCAATCTG GG CTAGCTACAACGA ATACACCA 


6265 


2659 


GUAUGCAG A UUGCCAAG 


5278 


CTTGGCAA GG CTAGCTACAACGA CTGCATAC 


6266 


2662 


UGCAGAUU G CCAAGGGG 


5279 


CCCCTTGG GG CTAGCTACAACGA AATCTGCA 


6267 


2671 


CCAAGGGG A UGAGCUAC 


5280 


GTAGCTCA GGCTAG CTACAACGA CCCCTTGG 


6268 


2675 


GGGGAUGA G CUACCUGG 


5281 


CCAGGTAG GGCTAG CTACAACGA TCATCCCC 


6269 


2678 


GAUGAGCU A CCUGGAGG 


5282 


CCTCCAGG GGCTAG CTACAACGA AGCTCATC 


6270 


2687 


CCUGGAGG A UGUGCGGC 


5283 


GCCGCACA GG CTAGCTACAACGA CCTCCAGG 


6271 


2689 


UGGAGGAU G UGCGGCUC 


5284 


GAGCCGCA GGCTAGCTACAACGA ATCCTCCA 


6272 


2691 


GAGGAUGU G CGGCUCGU 


5285 


ACGAGCCG GGCTAGCTACAACGA ACATCCTC 


6273 


2694 


GAUGUGCG G CUCGUACA 


5286 


TGTACGAG GGCTAGCTACAACGA CGCACATC 


6274 


2698 


UGCGGCUC G UACACAGG 


5287 


CCTGTGTA GGCTAGCTACAACGA GAGCCGCA 


6275 


2700 


CGGCUCGU A CACAGGGA 


5288 


TCCCTGTG GGCTAGCTACAACGA ACGAGCCG 


6276 


2702 


GCUCGUAC A CAGGGACU 


5289 


AGTCCCTG GGCTAGCTACAACGA GTACGAGC 


6277 


2708 


ACACAGGG A CUUGGCCG 


5290 


CGGCCAAG GGCTAGCTACAACGA CCCTGTGT 


6278 


2713 


GGGACUUG G CCGCUCGG 


5291 


CCGAGCGG GGCTAGCTACAACGA CAAGTCCC 


6279 


2716 


ACUUGGCC G CUCGGAAC 


5292 


GTTCCGAG GGCTAGCTACAACGA GGCCAAGT 


6280 


2723 


CGCUCGGA A CGUGCUGG 


5293 


CCAGCACG GGCTAGCTACAACGA TCCGAGCG 


6281 


2725 


CUCGGAAC G UGCUGGVC 


5294 


GACCAGCA GGCTAGCTACAACGA GTTCCGAG 


6282 


2727 


CGGAACGU G CUGGUCAA 


5295 


TTGACCAG GGCTAGCTACAACGA ACGTTCCG 


6283 


2731 


ACGUGCUG G UCAAGAGU 


5296 


ACTCTTGA GGCTAGCTACAACGA CAGCACGT 


6284 


2738 


GGUCAAGA G UCCCAACC 


5297 


GGTTGGGA GGCTAGCTACAACGA TCTTGACC 


6285 


2744 


GAGUCCCA A CCAUGUCA 


5298 


TGACATGG GGCTAGCTACAACGA TGGGACTC 


6286 


2747 


UCCCAACC A UGUCAAAA 


5299 


TTTTGACA GGCTAGCTACAACGA GGTTGGGA 


6287 


2749 


CCAACCAU G UCAAAAUU 


5300 


AATTTTGA GGCTAGCTACAACGA ATGGTTGG 


6288 


2755 


AUGUCAAA A UUACAGAC 


5301 


GTCTGTAA GGCTAGCTACAACGA TTTGACAT 


6289 


2758 


UCAAAAUU A CAGACUUC 


5302 


GAAGTCTG GGCTAGCTACAACGA AATTTTGA 


6290 


2762 


AAUUACAG A CUUCGGGC 


5303 


GCCCGAAG GGCTAGCTACAACGA CTGTAATT 


6291 


2769 


GACUUCGG G CUGGCUCG 


5304 


CGAGCCAG GGCTAGCTACAACGA CCGAAGTC 


6292 


2773 


UCGGGCUG G CUCGGCUG 


5305 


CAGCCGAG GGCTAGCTACAACGA CAGCCCGA 


6293 


2778 


CUGGCUCG G CUGCUGGA 


5306 


TCCAGCAG GGCTAGCTACAACGA CGAGCCAG 


6294 


2781 


GCUCGGCU G CUGGACAU 


5307 


ATGTCCAG GGCTAGCTACAACGA AGCCGAGC 


6295 


2786 


GCUGCUGG A CAUUGACG 


5308 


CGTCAATG GGCTAGCTACAACGA CCAGCAGC 


6296 


2788 


UGCUGGAC A UUGACGAG 


5309 


CTCGTCAA GGCTAGCTACAACGA GTCCAGCA 


6297 


2792 


GGACAUUG A CGAGACAG 


5310 


CTGTCTCG GGCTAGCTACAACGA CAATGTCC 


6298 


2797 


UUGACGAG A CAGAGUAC 


5311 


GTACTCTG GGCTAGCTACAACGA CTCGTCAA 


6299 


2802 


GAGACAGA G UACCAUGC 


5312 


GCATGGTA GGCTAGCTACAACGA TCTGTCTC 


6300 


2804 


GACAGAGU A CCAUGCAG 


5313 


CTGCATGG GGCTAGCTACAACGA ACTCTGTC 


6301 


2807 


AGAGUACC A UGCAGAUG 


5314 


CATCTGCA GGCTAGCTACAACGA GGTACTCT 


6302 


2809 


AGUACCAU G CAGAUGGG 


5315 


CCCATCTG GGCTAGCTACAACGA ATGGTACT 


6303 


2813 


CCAUGCAG A UGGGGGCA 


5316 


TGCCCCCA GGCTAGCTACAACGA CTGCATGG 


6304 


2819 


AGAUGGGG G CAAGGUGC 


5317 


GCACCTTG GGCTAGCTACAACGA CCCCATCT 


6305 


2824 


GGGGCAAG G UGCCCAUC 


5318 


GATGGGCA GGCTAGCTACAACGA CTTGCCCC 


6306 


2826 


GGCAAGGU G CCCAUCAA 


5319 


TTGATGGG GGCTAGCTACAACGA ACCTTGCC 


6307 


2830 


AGGUGCCC A UCAAGUGG 


5320 


CCACTTGA GGCTAGCTACAACGA GGGCACCT 


6308 


2835 


CCCAUCAA G UGGAUGGC 


5321 


GCCATCCA GGCTAGCTACAACGA TTGATGGG 


6309 


2839 


UCAAGUGG A UGGCGCUG 


5322 


CAGCGCCA GGCTAGCTACAACGA CCACTTGA 


6310 


2842 


AGUGGAUG G CGCUGGAG 


5323 


CTCCAGCG GGCTAGCTACAACGA CATCCACT 


6311 


2844 


UGGAUGGC G CUGGAGUC 


5324 


GACTCCAG GGCTAGCTACAACGA GCCATCCA 


6312 


2850 


GCGCUGGA G UCCAUUCU 


5325 


AGAATGGA GGCTAGCTACAACGA TCCAGCGC 


6313 


2854 


UGGAGUCC A UUCUCCGC 


5326 


GCGGAGAA GGCTAGCTACAACGA GGACTCCA 


6314 
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2861 


CAUUCUCC G CCGGCGGU 


; 5327 


ACCGCCGG GGCTAGCTACAACGA GGAGAATG 


6315 


2865 


CVCCGCCG G CGGUUCAC 


5328 


GTGAACCG GGCTAGCTACAACGA CGGCGGAG 


6316 


2868 


CGCCGGCG G UUCACCCA 


5329 


TGGGTGAA GGCTAGCTACAACGA CGCCGGCG 


6317 


2872 


GGCGGUUC A CCCACCAG 


5330 


CTGGTGGG GGCTAGCTACAACGA GAACCGCC 


6318 


2876 


GUUCACCC A CCAGAGUG 


5331 


CACTCTGG GGCTAGCTACAACGA GGGTGAAC 


6319 


2882 


CCACCAGA G UGAUGUGU 


5332 


ACACATCA GGCTAGCTACAACGA TCTGGTGG 


6320 


2885 


CCAGAGUG A UGUGUGGA 


5333 


TCCACACA GGCTAGCTACAACGA CACTCTGG 


6321 


2887 


AGAGUGAU G UGUGGAGU 


5334 


ACTCCACA GGCTAGCTACAACGA ATCACTCT 


6322 


2889 


AGUGAUGU G UGGAGUUA 


5335 


TAACTCCA GGCTAGCTACAACGA ACATCACT 


6323 


2894 


UGUGUGGA G UUAUGGUG 


5336 


CACCATAA GGCTAGCTACAACGA TCCACACA 


6324 


2897 


GUGGAGUU A UGGUGUGA 


5337 


TCACACCA GGCTAGCTACAACGA AACTCCAC 


6325 


2900 


GAGUUAUG G UGUGACUG 


5338 


CAGTCACA GGCTAGCTACAACGA CATAACTC 


6326 


2902 


GUUAUGGU G UGACUGUG 


5339 


CACAGTCA GGCTAGCTACAACGA ACCATAAC 


6327 


2905 


AUGGUGUG A CUGUGUGG 


5340 


CCACACAG GGCTAGCTACAACGA CACACCAT 


6328 


2908 


GUGUGACU G UGUGGGAG 


5341 


CTCCCACA GGCTAGCTACAACGA AGTCACAC 


6329 


2910 


GUGACUGU G UGGGAGCU 


5342 


AGCTCCCA GGCTAGCTACAACGA ACAGTCAC 


6330 


2916 


GUGUGGGA G CUGAUGAC 


5343 


GTCATCAG GGCTAGCTACAACGA TCCCACAC 


6331 


2920 


GGGAGCUG A UGACUUUU 


5344 


AAAAGTCA GGCTAGCTACAACGA CAGCTCCC 


6332 


2923 


AGCUGAUG A CUUUUGGG 


5345 


CCCAAAAG GGCTAGCTACAACGA CATCAGCT 


6333 


2932 


CUUUUGGG G CCAAACCU 


5346 


AGGTTTGG GGCTAGCTACAACGA CCCAAAAG 


6334 


2937 


GGGGCCAA A CCUUACGA 


5347 


TCGTAAGG GGCTAGCTACAACGA TTGGCCCC 


6335 


2942 


CAAACCUU A CGAUGGGA 


5348 


TCCCATCG GGCTAGCTACAACGA AAGGTTTG 


6336 


2945 


ACCUUACG A UGGGAUCC 


5349 


GGATCCCA GGCTAGCTACAACGA CGTAAGGT 


6337 


2950 


ACGAUGGG A UCCCAGCC 


5350 


GGCTGGGA GGCTAGCTACAACGA CCCATCGT 


6338 


2956 


GGAUCCCA G CCCGGGAG 


5351 


CTCCCGGG GGCTAGCTACAACGA TGGGATCC 


6339 


2965 


CCCGGGAG A UCCCUGAC 


5352 


GTCAGGGA GGCTAGCTACAACGA CTCCCGGG 


6340 


2972 


GAUCCCUG A CCUGCUGG 


5353 


CCAGCAGG GGCTAGCTACAACGA CAGGGATC 


6341 


2976 


CCUGACCU G CUGGAAAA 


5354 


TTTTCCAG GGCTAGCTACAACGA AGGTCAGG 


6342 


2991 


AAGGGGGA G CGGCUGCC 


5355 


GGCAGCCG GGCTAGCTACAACGA TCCCCCTT 


6343 


2994 


GGGGAGCG G CUGCCCCA 


5356 


TGGGGCAG GGCTAGCTACAACGA CGCTCCCC 


6344 


2997 


GAGCGGCU G CCCCAGCC 


5357 


GGCTGGGG GGCTAGCTACAACGA AGCCGCTC 


6345 


3003 


CUGCCCCA G CCCCCCAU 


5358 


ATGGGGGG GGCTAGCTACAACGA TGGGGCAG 


6346 


3010 


AGCCCCCC A UCUGCACC 


5359 


GGTGCAGA GGCTAGCTACAACGA GGGGGGCT 


6347 


3014 


CCCCAUCU G CACCAUUG 


5360 


CAATGGTG GGCTAGCTACAACGA AGATGGGG 


6348 


3016 


CCAUCUGC A CCAUUGAU 


5361 


ATCAATGG GGCTAGCTACAACGA GCAGATGG 


6349 


3019 


UCUGCACC A UUGAUGUC 


5362 


GACATCAA GGCTAGCTACAACGA GGTGCAGA 


6350 


3023 


CACCAUUG A UGUCUACA 


5363 


TGTAGACA GGCTAGCTACAACGA CAATGGTG 


6351 


3025 


CCAUUGAU G UCUACAUG 


5364 


CATGTAGA GGCTAGCTACAACGA ATCAATGG 


6352 


3029 


UGAUGUCU A CAUGAUCA 


5365 


TGATCATG GGCTAGCTACAACGA AGACATCA 


6353 


3031 


AUGUCUAC A UGAUCAUG 


5366 


CATGATCA GGCTAGCTACAACGA GTAGACAT 


6354 


3034 


UCUACAUG A UCAUGGUC 


5367 


GACCATGA GGCTAGCTACAACGA CATGTAGA 


6355 


3037 


ACAUGAUC A UGGUCAAA [ 


5368 


TTTGACCA GGCTAGCTACAACGA GATCATGT 


6356 


3040 


UGAUCAUG G UCAAAUGU 


5369 


ACATTTGA GGCTAGCTACAACGA CATGATCA 


6357 


3045 


AUGGUCAA A UGUUGGAU 


5370 


ATCCAACA GGCTAGCTACAACGA TTGACCAT 


6358 


3047 


GGUCAAAU G UUGGAUGA 


5371 


TCATCCAA GGCTAGCTACAACGA ATTTGACC 


6359 


3052 


AAUGUUGG A UGAUUGAC 


5372 


GTCAATCA GGCTAGCTACAACGA CCAACATT 


6360 


3055 


GUUGGAUG A UUGACUCU 


5373 


AGAGTCAA GGCTAGCTACAACGA CATCCAAC 


6361 


3059 


GAUGAUUG A CUCUGAAU 


5374 


ATTCAGAG GGCTAGCTACAACGA CAATCATC 


6362 


3066 


GACUCUGA A UGUCGGCC 


5375 


GGCCGACA GGCTAGCTACAACGA TCAGAGTC 


6363 


3068 


CUCUGAAU G UCGGCCAA ! 


5376 


TTGGCCGA GGCTAGCTACAACGA ATTCAGAG 


6364 


3072 


GAAUGUCG G CCAAGAUU 


5377 


AATCTTGG GGCTAGCTACAACGA CGACATTC 


6365 


3078 


CGGCCAAG A UUCCGGGA 


5378 


TCCCGGAA GGCTAGCTACAACGA CTTGGCCG 


6366 
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3087 


TTTTOOOOOA /~* T TT THHT TOT T/"1 

UUCCGGGA G UUGGUGUC 


5379 


OAOTvOOAA OOOfTlA OOFT1TV /-1TV TV /~tO A mOOOOO TV TV 

GACACCAA GGCTAGCTACAACGA TCCCGGAA 


6367 




ooo aotttto o t 707 to7 7P a a 
bbuAbUUb G UGUCUGAA 


5380 


mm /-t TV O A O TV OOOT' A O OTiTV /""VTV TV /TO TV OA TV Pmn/in 

1 1 CAGACA GGL1AGLIACAACGA CAACTCCC 


6368 




/~i 7\ /"IT TT 70 O T T O 7 T OT 70 7\ 7\ T TT 7 

GAGUUGGU G UCUGAAUU 


5381 


ATvmmoAOA oo orpA oorr»A /r a a /"vnv a oo a a /im/i 

AATTCAGA GGCTAGCTACAACGA ACCAACTC 


6369 


1 A O Q 

JU99 


/—IT TOT T/*1T T/1 7\ 7\ TTTTOTT/^/^rin 

GUGUCUGA A UUCUCCCG 


5382 


CGGGAGAA GGCTAGCTACAACGA TCAGACAC 


6370 


3 i.U / 


7\ 7 77 TOT TO OO O P7\TTPPPPA 

AUUCUCCC G CAUGGCCA 


5383 


TGGCCATG GGCTAGCTACAACGA GGGAGAAT 


6371 


3109 


UCUCCCGC A UGGCCAGG 


5384 


P/"imr»n/iyni» PnPmw Pnm» ni\ » Pnii nppnrtumk 

CCTGGCCA GGCTAGCTACAACGA GCGGGAGA 


6372 


•31 1 <5 


PPPPPAT TO O 007X0007X0 

LLLoUAUo G CCAGGGAC 


5385 


pmp pprnp O OO OT" A OOm A O A A OO A O A TV»00/"T^"» 

G1CCCTGG GGC1 AGCTACAACGA CATGCGGG 


6373 


11 1 Q 


LiGCCAGGG A LLLLuibL 


5386 


GCTGGGGG GGCTAGCTACAACGA CCCTGGCC 


6374 




07\ PPPPPT\ O OO OT 77 T7 70T T 

GACLCLLA G LGCUUUGU 


5387 


ACAAAGCG GGCTAGCTACAACGA TGGGGGTC 


6375 


"51 OQ 


ccr^r^c^T^r*^ o or T7 tt to t 70 0 
1-L.CCCAGC v CUUULjUGG 


5388 


OOA OA A AO PPPHTl 0/™«TiA /TA A /~1/~» A /~t nno f~t/~M~*f^ 

CCACAAAG GGCTAGCTACAACGA GCTGGGGG 


6376 




AOrTTTTTTTT O TTf"2PTTP A tto 


5389 


OAT'OAOOA OOOTIA OOTIA OA A OO A A A A OOO / HI! 

GAIGACLA GGLI AGCTACAACGA AAAGCGCT 


6377 


J 1 JO 


O OI 77 77 70 T 70 O TTO7\7700 A O 

GCUUUGUG G UCAULCAG 


5390 


OT^OOATOA OOOT'AOOT'A OA A OO A OA OA A A OO 

CTGGATGA GGCTAGCTACAACGA CACAAAGC 


6378 


1 1 Q 


TTT TOT TOOT TO A TTOOAPAATT 


5391 


A n^T^OT^OO A O PT A O OTT A O A A OO A OAOOAOTVA 

ATTCTGGA GGCTAGCTACAACGA GACCACAA 


6379 




OAT TOO AO A A 7 70 A OO A OTT 


5392 


AOrpOOTtOA PPPWH O /TITl A /""« A A /^/~t A iDniDnOltmn 

AGTCCTCA GGCTAGCTACAACGA TCTGGATG 


6380 


■ji o 


piv ATTOA/T 1 A TIT/* 1 PPPP 

GAAUGAGG A GUUGGGGG 


5393 


oooooaao 000mA oomA m iv /^/^ a /inwo t\ >nrn/-i 

GGCCCAAG GGCTAGCTACAACGA CCTCATTC 


6381 


iloo 


00 a 0777 too 0 r~T i f~ , T\f~'r* i r*~T\. 
GGAGUUGG G CCCAGCCA 


5394 


nr»/^/^/T»Tn/T/~i/T r<nrnrnvo/Hfm\ n* t» o/m nnit nnrn/-i/-i 

TGGCTGGG GGCTAGCTACAACGA CCAAGTCC 


6382 


n CI 


TTOOOOOOA O PPAPTTPPP 


5395 


pritpo O O OTi A O nm TV /"ttv TV OO A rnnPi^nnyiiv 

GGGAC1GG GGC1 AGCTACAACGA TGGGCCCA 


6383 


J JL O / 


r>r % r , i\r t r , r*T\ 0 t tooot tt too 
V-L.LAGGGA UGGGUUGG 


5396 


OOAAOOOA OOOHTA OOTi A OTV A OO A mflflPmnn/^ 

CCAAGGGA GGCTAGCTACAACGA TGGCTGGG 


6384 


11 ic 


T tooot tt too a oaooaoott 
UCCCUUGG A CAGCACCU 


5397 


AGGTGCTG GGCTAGCTACAACGA CCAAGGGA 


6385 


1 1 1 Q 


OTTTTOOAOA O OA OO! 77 TOT T 

LUUGGAGA G LACCUUCU 


5398 


AGAAGGTG GGCTAGCTACAACGA TGTCCAAG 


6386 


3181 


TT^ivnunn a oottitotttvo 

UGGACAGC A CCUUCUAC 


5399 


GTAGAAGG GGCTAGCTACAACGA GCTGTCCA 


6387 


3 188 


OAOOTTTTOTT A OOOOTTOTV/" 1 ! 

CACCUUCU A CCGCUCAC 


5400 


GTGAGCGG GGCTAGCTACAACGA AGAAGGTG 


6388 


3191 


Of TTTOTTA OO O OTTOAOTTOO 

CUUCUALC G CUCACUGC 


5401 


GCAGTGAG GGCTAGCTACAACGA GGTAGAAG 


6389 


3195 


UACCGCUC A CUGCUGGA 


5402 


TCCAGCAG GGCTAGCTACAACGA GAGCGGTA 


6390 


3198 


0/"T/"1TT/~nv /TTT /*1 OTTrtnTirirtll 

CGCUCACU G CUGGAGGA 


5403 


TCCTCCAG GGCTAGCTACAACGA AGTGAGCG 


6391 


3206 


GCUGGAGG A CGAUGACA 


5404 


TGTCATCG GGCTAGCTACAACGA CCTCCAGC 


6392 


1 1 A O 


GGAGGACG A UGACAUGG 


5405 


CCATGTCA GGCTAGCTACAACGA CGTCCTCC 


6393 


3212 


GGACGAUG A CAUGGGGG 


5406 


CCCCCATG GGCTAGCTACAACGA CATCGTCC 


6394 


3214 


ACGAUGAC A UGGGGGAC 


5407 


GTCCCCCA GGCTAGCTACAACGA GTCATCGT 


6395 


3221 


PnTTnn/>pn i\ onTT/^ntT/*in 

CAUGGGGG A CCUGGUGG 


5408 


CCACCAGG GGCTAGCTACAACGA CCCCCATG 


6396 


in/; 


OOO A OOI TO O T TO OAT TO OT T 

GGGACCUG G UGGAUGCU 


5409 


AGCATCCA GGCTAGCTACAACGA CAGGTCCC 


6397 


inn 


OOI TOOT TOO A TTOOTTOAOO 

LCUGGUGG A UGCUGAGG 


5410 


CCTCAGCA GGCTAGCTACAACGA CCACCAGG 


6398 


iiii 
J ^ J 


TTOOTTOOATT O OT TO A OO TV /T 

UGGUGGAU G CUGAGGAG 


5411 


CTCCTCAG GGCTAGCTACAACGA ATCCACCA 


6399 




POTTO A OO A O TTATTOTTOOTT 

GLUGAGGA G UAUGUGGU 


5412 


ACCAGATA GGCTAGCTACAACGA TCCTCAGC 


6400 


3242 


UGAGGAGU A UCUGGUAC 


5413 


GTACCAGA GGCTAGCTACAACGA ACTCCTCA 


6401 


3247 


AGUAUCUG G UACCCCAG 


5414 


CTGGGGTA GGCTAGCTACAACGA CAGATACT 


6402 


3249 


UAUCUGGU A CCCCAGCA 


5415 


TGCTGGGG GGCTAGCTACAACGA ACCAGATA 


6403 


3255 


GUACCCCA G CAGGGCUU 


5416 


AAGCCCTG GGCTAGCTACAACGA TGGGGTAC 


6404 


3260 


CCAGCAGG G CUUCUUCU 


5417 


AGAAGAAG GGCTAGCTACAACGA CCTGCTGG 


6405 


3269 


CUUCUUCU G UCCAGACC 


5418 


GGTCTGGA GGCTAGCTACAACGA AGAAGAAG 


6406 


3275 


CUGUCCAG A CCCUGCCC 


5419 


GGGCAGGG GGCTAGCTACAACGA CTGGACAG 


6407 


3280 


CAGACCCU G CCCCGGGC 


5420 


GCCCGGGG GGCTAGCTACAACGA AGGGTCTG 


6408 


3287 


UGCCCCGG G CGCUGGGG 


5421 


CCCCAGCG GGCTAGCTACAACGA CCGGGGCA 


6409 


3289 


CCCCGGGC G CUGGGGGC 


5422 


GCCCCCAG GGCTAGCTACAACGA GCCCGGGG 


6410 


3296 


CGCUGGGG G CAUGGUCC 


5423 


GGACCATG GGCTAGCTACAACGA CCCCAGCG 


6411 


3298 


CUGGGGGC A UGGUCCAC 


5424 


GTGGACCA GGCTAGCTACAACGA GCCCCCAG 


6412 


3301 


GGGGCAUG G UCCACCAC 


5425 


GTGGTGGA GGCTAGCTACAACGA CATGCCCC 


6413 


3305 


CAUGGUCC A CCACAGGC 


5426 


GCCTGTGG GGCTAGCTACAACGA GGACCATG 


6414 


3308 


GGUCCACC A CAGGCACC 


5427 


GGTGCCTG GGCTAGCTACAACGA GGTGGACC 


6415 


3312 


CACCACAG G CACCGCAG 


5428 


CTGCGGTG GGCTAGCTACAACGA CTGTGGTG 


6416 


3314 


CCACAGGC A CCGCAGCU 


5429 


AGCTGCGG GGCTAGCTACAACGA GCCTGTGG 


6417 


3317 


CAGGCACC G CAGCUCAU 


5430 


ATGAGCTG GGCTAGCTACAACGA GGTGCCTG 


6418 
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3320 


GCACCGCA G CUCAUCUA 


5431 


rTTTV /-l T\ TV /T /■> f*% /"1fT\T\ /T /"trrt TV /-? TV TV /T/T TV m/t /"imn /"t 

TAGATGAG GGCTAGCTACAACGA TGCGGTGC 


6419 


3 324 


CGCAGCUC A UCUACCAG 


5432 


nmnprnnnTv PPPTHPPTT\PMvnni\ pi\ppitippp 

CTGGTAGA GGCTAGCTACAACGA GAGCTGCG 


6420 


3328 


GCUCAUCU A CCAGGAGU 


5433 


APT*PP»T»PP PPPHPAPPTIA P7l TV PP T\ TV P TV T»P 7\ P P 

ACTCCTGG GGCTAGCTACAACGA AGATGAGC 


6421 


O O "3 C 


T TT\ PP A PP TV P T TPP pppT TP 

UACCAGGA G UGGCGGUG 


5434 


/~>t\ ripr>PriT\ pppt>7v ppt* a pa t\ pp t\ nripPTtpprrtTi 

CACCGCCA GGC J.AGCJ.ACAACGA TCCTGGTA 


6422 


3338 


CAGGAGUG G CGGUGGGG 


5435 


/■J/T/T^'ITA ^T /"1/~T /1/"1/*trm\ 1 1 TV /*1 TV TV /"1/*1TA TV / tf 1 ) f~%/^\ FTT/T 

CCCCACCG GGCTAGCTACAACGA CACTCCTG 


6423 




GAG UGGGG G UGGGGACC 


5436 


p'p» r pr' , p , pip»A pppt<a ppt 1 a p a 7\ pp a r>fTT>7\ ptp 
GGrCCCCA GGCTAGCTACAACGA CGCCACIC 


6424 


Jj4 / 


GGGUGGGG A CCUGACAC 


5437 


PTPI'PAPP PPPT A PPT 1 A P A 7\ PP 7V PPIPV^A PPP 

GTGTCAGG GGCTAGCTACAACGA CCCCACCG 


6425 




PPP7\PPITP A P 1 TV riT mppp 

GvjvjAC C U G A GAG U ALjVj<jt 


5438 


ppprpTvprpp PPPTAPPTAPA 7V PPA PTVPPTP'PP' 

CCClAvalG GGC1AGC1ACAACGA CAGGICCC 


6426 


3354 


p 71 npr Tn TV p TV PTIAPPPPTT 

GALL UGAL. A G L/AGGGG U 


5439 


APPPPT7VP PP PTI7V PPT7V P A 7k PP7V PT»PTkPPrpp 

AGCCC1AG GGCIAGC1ACAACGA GICAGG1C 


6427 


J JbU 


tv pa pnnpp /™» piTPPTi pnp 
AGAG U AGG G G U GGAGG G 


5440 


pp / m i ipp 7\ p ppprpA pipirnTi PA A PP A PPT TV PTPT 

GGLrrCCAG GGCTAGCTACAACGA CCTAGTGT 


6428 


"3 "3 C C 

J Job 


PPPPITPPA Pi PPPTTPT TP 7V 

GGGGUGGA G GCGUGUGA 


5441 


T»PAPAPPP pp PT A p prpTV PTV TV PP TV TPPnPPPP 

TCAGAGGG GGCTAGCTACAACGA TCCAGCCC 


6429 




TV A P A PP A P P PPPPPAPT*" 

AAGAGGAG G GGGGGAGG 


5442 


p/ ii i inppnri ppprPTv ppvttiv ptv tv pp tv prnpp/npfPfn 

CCTGGGGG GGCTAGCTACAACGA CTCCTCTT 


6430 


3390 


ppprtppTi /-I p T TPTTPP A PI T 

GGGGGGAG G UGUGGAGU 


5443 


7V prpppTi p TV PPPT A p PT A PA 7k PP 7V /"""TVT"'PPP P 

AGTGGAGA GGCTAGCTACAACGA CTGGGGGC 


6431 


3396 


7V PPT TPT TPP A PTTPPPAPP 

AGGUCUCC A CUGGCACC 


5444 


PPTPPP7VP P P PfTl 7k P PIT* TV P TV IV PP TV PPHPAPPW 

GGTGCCAG GGCTAGCTACAACGA GGAGACCT 


6432 


3400 


PT TPPTV PT TP P Ptv nnPTTPP 

CUCCACUG G CACCCUCC 


5445 


GGAGGGTG GGCTAGCTACAACGA CAGTGGAG 


6433 


3402 


CCACUGGC A CCCUCCGA 


5446 


TCGGAGGG GGCTAGCTACAACGA GCCAGTGG 


6434 


3415 


CCGAAGGG G CUGGCUCC 


5447 


GGAGCCAG GGCTAGCTACAACGA CCCTTCGG 


6435 


3419 


AGGGGCUG G CUCCGAUG 


5448 


CATCGGAG GGCTAGCTACAACGA CAGCCCCT 


6436 


3425 


UGGCUCCG A UGUAUUUG 


5449 


CAAATACA GGCTAGCTACAACGA CGGAGCCA 


6437 


3427 


GCUCCGAU G UAUUUGAU 


5450 


ATCAAATA GGCTAGCTACAACGA ATCGGAGC 


6438 


3429 


UCCGAUGU A UUUGAUGG 


5451 


CCATCAAA GGCTAGCTACAACGA ACATCGGA 


6439 


3434 


UGUAUUUG A UGGUGACC 


5452 


GGTCACCA GGCTAGCTACAACGA CAAATACA 


6440 


3437 


AUUUGAUG G UGACCUGG 


5453 


CCAGGTCA GGCTAGCTACAACGA CATCAAAT 


6441 


3440 


UGAUGGUG A CCUGGGAA 


5454 


TTCCCAGG GGCTAGCTACAACGA CACCATCA 


6442 


3448 


ACCUGGGA A UGGGGGCA 


5455 


TGCCCCCA GGCTAGCTACAACGA TCCCAGGT 


6443 


3454 


GAAUGGGG G CAGCCAAG 


5456 


CTTGGCTG GGCTAGCTACAACGA CCCCATTC 


6444 


3457 


UGGGGGCA G CCAAGGGG 


5457 


CCCCTTGG GGCTAGCTACAACGA TGCCCCCA 


6445 


3 4 65 


GCCAAGGG G CUGCAAAG 


5458 


CTTTGCAG GGCTAGCTACAACGA CCCTTGGC 


6446 


3468 


TV Tv /~i /i /*i /*rr t /""l /^i tv tv tv /**^t t 

AAGGGGCU G CAAAGCCU 


5459 


AGGCTTTG GGCTAGCTACAACGA AGCCCCTT 


6447 


3473 


GCUGCAAA G CCUCCCCA 


5460 


TGGGGAGG GGCTAGCTACAACGA TTTGCAGC 


6448 


3481 


GCCUCCCC A CACAUGAC 


5461 


GTCATGTG GGCTAGCTACAACGA GGGGAGGC 


6449 


3483 


CUCCCCAC A CAUGACCC 


5462 


GGGTCATG GGCTAGCTACAACGA GTGGGGAG 


6450 


3485 


^/1/~1/-n\ /T. TV TV T T/~ t TV /"I /"l ^T, /""» TV 

CCCCACAC A UGACCCCA 


5463 


TGGGGTCA GGCTAGCTACAACGA GTGTGGGG 


6451 


3488 


/"1T\ /"I TV ^1 TV ftn TV j*1i^^T/TT\ /~\ /*T/*» 

CACACAUG A CCCCAGCC 


5464 


GGCTGGGG GGCTAGCTACAACGA CATGTGTG 


6452 


3494 


UGACCCCA G CCCUCUAC 


5465 


GTAGAGGG GGCTAGCTACAACGA TGGGGTCA 


6453 


3501 


AGCCCUCU A CAGCGGUA 


5466 


TACCGCTG GGCTAGCTACAACGA AGAGGGCT 


6454 


3504 


CCUCUACA G CGGUACAG 


5467 


CTGTACCG GGCTAGCTACAACGA TGTAGAGG 


6455 


3507 


CUACAGCG G UACAGUGA 


5468 


TCACTGTA GGCTAGCTACAACGA CGCTGTAG 


6456 


3509 


ACAGCGGU A CAGUGAGG 


5469 


CCTCACTG GGCTAGCTACAACGA ACCGCTGT 


6457 


3512 


GCGGUACA G UGAGGACC 


5470 


GGTCCTCA GGCTAGCTACAACGA TGTACCGC 


6458 


3518 


CAGUGAGG A CCCCACAG 


5471 


CTGTGGGG GGCTAGCTACAACGA CCTCACTG 


6459 


3523 


AGGACCCC A CAGUACCC 


5472 


GGGTACTG GGCTAGCTACAACGA GGGGTCCT 


6460 


3526 


ACCCCACA G UACCCCUG 


5473 


CAGGGGTA GGCTAGCTACAACGA TGTGGGGT 


6461 


3528 


CCCACAGU A CCCCUGCC 


5474 


GGCAGGGG GGCTAGCTACAACGA ACTGTGGG 


6462 


3534 


GUACCCCU G CCCUCUGA 


5475 


TCAGAGGG GGCTAGCTACAACGA AGGGGTAC 


6463 


3544 


CCUCUGAG A CUGAUGGC 


5476 


GCCATCAG GGCTAGCTACAACGA CTCAGAGG 


6464 


3548 


UGAGACUG A UGGCUACG 


5477 


CGTAGCCA GGCTAGCTACAACGA CAGTCTCA 


6465 


3551 


GACUGAUG G CUACGUUG 


5478 


CAACGTAG GGCTAGCTACAACGA CATCAGTC 


6466 


3554 


UGAUGGCU A CGUUGCCC 


5479 


GGGCAACG GGCTAGCTACAACGA AGCCATCA 


6467 


3556 


AUGGCUAC G UUGCCCCC 


5480 


GGGGGCAA GGCTAGCTACAACGA GTAGCCAT 


6468 


3559 


GCUACGUU G CCCCCCUG 


5481 


CAGGGGGG GGCTAGCTACAACGA AACGTAGC 


6469 


3568 


CCCCCCUG A CCUGCAGC 


5482 


GCTGCAGG GGCTAGCTACAACGA CAGGGGGG 


6470 
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3572 


CCUGACCU G CAGCCCCC 


5483 


GGGGGCTG GGCTAGCTACAACGA AGGTCAGG 


6471 


3575 


GACCUGCA G CCCCCAGC 


5484 


GCTGGGGG GGCTAGCTACAACGA TGCAGGTC 


6472 


3582 


AGCCCCCA G CCUGAAUA 


5485 


TATTCAGG GGCTAGCTACAACGA TGGGGGCT 


6473 


3588 


CAGCCUGA A UAUGUGAA 


5486 


TTCACATA GGCTAGCTACAACGA TCAGGCTG 


6474 


3590 


GCCUGAAU A UGUGAACC 


5487 


GGTTCACA GGCTAGCTACAACGA ATTCAGGC 


6475 


3592 


CUGAAUAU G UGAACCAG 


5488 


CTGGTTCA GGCTAGCTACAACGA ATATTCAG 


6476 


3596 


AUAUGUGA A CCAGCCAG 


5489 


CTGGCTGG GGCTAGCTACAACGA TCACATAT 


6477 


3600 


GUGAACCA G CCAGAUGU 


5490 


ACATCTGG GGCTAGCTACAACGA TGGTTCAC 


6478 


3605 


CCAGCCAG A UGUUCGGC 


5491 


GCCGAACA GGCTAGCTACAACGA CTGGCTGG 


6479 


3607 


AGCCAGAU G UUCGGCCC 


5492 


GGGCCGAA GGCTAGCTACAACGA ATCTGGCT 


6480 


3612 


GAUGUUCG G CCCCAGCC 


5493 


GGCTGGGG GGCTAGCTACAACGA CGAACATC 


6481 


3618 


CGGCCCCA G CCCCCUUC 


5494 


GAAGGGGG GGCTAGCTACAACGA TGGGGCCG 


6482 


3627 


CCCCCUUC G CCCCGAGA 


5495 


TCTCGGGG GGCTAGCTACAACGA GAAGGGGG 


6483 


3638 


CCGAGAGG G CCCUCUGC 


5496 


GCAGAGGG GGCTAGCTACAACGA CCTCTCGG 


6484 


3645 


GGCCCUCU G CCUGCUGC 


5497 


GCAGCAGG GGCTAGCTACAACGA AGAGGGCC 


6485 


3649 


CUCUGCCU G CUGCCCGA 


5498 


TCGGGCAG GGCTAGCTACAACGA AGGCAGAG 


6486 


3652 


UGCCUGCU G CCCGACCU 


5499 


AGGTCGGG GGCTAGCTACAACGA AGCAGGCA 


6487 


3657 


GCUGCCCG A CCUGCUGG 


5500 


CCAGCAGG GGCTAGCTACAACGA CGGGCAGC 


6488 


3661 


CCCGACCU G CUGGUGCC 


5501 


GGCACCAG GGCTAGCTACAACGA AGGTCGGG 


6489 


3665 


ACCUGCUG G UGCCACUC 


5502 


GAGTGGCA GGCTAGCTACAACGA CAGCAGGT 


6490 


3667 


CUGCUGGU G CCACUCUG 


5503 


CAGAGTGG GGCTAGCTACAACGA ACCAGCAG 


6491 


3670 


CUGGUGCC A CUCUGGAA 


5504 


TTCCAGAG GGCTAGCTACAACGA GGCACCAG 


6492 


3681 


CUGGAAAG G CCCAAGAC 


5505 


GTCTTGGG GGCTAGCTACAACGA CTTTCCAG 


6493 


3688 


GGCCCAAG A CUCUCUCC 


5506 


GGAGAGAG GGCTAGCTACAACGA CTTGGGCC 


6494 


3707 


AGGGAAGA A UGGGGUCG 


5507 


CGACCCCA GGCTAGCTACAACGA TCTTCCCT 


6495 


3712 


AGAAUGGG G UCGUCAAA 


5508 


TTTGACGA GGCTAGCTACAACGA CCCATTCT 


6496 


3715 


AUGGGGUC G UCAAAGAC 


5509 


GTCTTTGA GGCTAGCTACAACGA GACCCCAT 


6497 


3722 


CGUCAAAG A CGUUUUUG 


5510 


CAAAAACG GGCTAGCTACAACGA CTTTGACG 


6498 


3724 


UCAAAGAC G UUUUUGCC 


5511 


GGCAAAAA GGCTAGCTACAACGA GTCTTTGA 


6499 


3730 


ACGUUUUU G CCUUUGGG 


5512 


CCCAAAGG GGCTAGCTACAACGA AAAAACGT 


6500 


3740 


CUUUGGGG G UGCCGUGG 


5513 


CCACGGCA GGCTAGCTACAACGA CCCCAAAG 


6501 


3742 


UUGGGGGU G CCGUGGAG 


5514 


CTCCACGG GGCTAGCTACAACGA ACCCCCAA 


6502 


3745 


GGGGUGCC G UGGAGAAC 


5515 


GTTCTCCA GGCTAGCTACAACGA GGCACCCC 


6503 


3752 


CGUGGAGA A CCCCGAGU 


5516 


ACTCGGGG GGCTAGCTACAACGA TCTCCACG 


6504 


3759 


AACCCCGA G UACUUGAC 


5517 


GTCAAGTA GGCTAGCTACAACGA TCGGGGTT 


6505 


3761 


CCCCGAGU A CUUGACAC 


5518 


GTGTCAAG GGCTAGCTACAACGA ACTCGGGG 


6506 


3766 


AGUACUUG A CACCCCAG 


5519 


CTGGGGTG GGCTAGCTACAACGA CAAGTACT 


6507 


3768 


UACUUGAC A CCCCAGGG 


5520 


CCCTGGGG GGCTAGCTACAACGA GTCAAGTA 


6508 


3781 


AGGGAGGA G CUGCCCCU 


5521 


AGGGGCAG GGCTAGCTACAACGA TCCTCCCT 


6509 


3784 


GAGGAGCU G CCCCUCAG 


5522 


CTGAGGGG GGCTAGCTACAACGA AGCTCCTC 


6510 


3792 


GCCCCUCA G CCCCACCC 


5523 


GGGTGGGG GGCTAGCTACAACGA TGAGGGGC 


6511 


3797 


UCAGCCCC A CCCUCCUC 


5524 


GAGGAGGG GGCTAGCTACAACGA GGGGCTGA 


6512 


3808 


CUCCUCCU G CCUUCAGC 


5525 


GCTGAAGG GGCTAGCTACAACGA AGGAGGAG 


6513 


3815 


UGCCUUCA G CCCAGCCU 


5526 


AGGCTGGG GGCTAGCTACAACGA TGAAGGCA 


6514 


3820 


UCAGCCCA G CCUUCGAC 


5527 


GTCGAAGG GGCTAGCTACAACGA TGGGCTGA 


6515 


3827 


AGCCUUCG A CAACCUCU 


5528 


AGAGGTTG GGCTAGCTACAACGA CGAAGGCT 


6516 


3830 


CUUCGACA A CCUCUAUU 


5529 


AATAGAGG GGCTAGCTACAACGA TGTCGAAG 


6517 


3836 


CAACCUCU A UUACUGGG 


5530 


CCCAGTAA GGCTAGCTACAACGA AGAGGTTG 


6518 


3839 


CCUCUAUU A CUGGGACC 


5531 


GGTCCCAG GGCTAGCTACAACGA AATAGAGG 


6519 


3845 


UUACUGGG A CCAGGACC 


5532 


GGTCCTGG GGCTAGCTACAACGA CCCAGTAA 


6520 


3851 


GGACCAGG A CCCACCAG 


5533 


CTGGTGGG GGCTAGCTACAACGA CCTGGTCC 


6521 


3855 


CAGGACCC A CCAGAGCG 


5534 


CGCTCTGG GGCTAGCTACAACGA GGGTCCTG 


6522 
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3861 


CCACCAGA G CGGGGGGC 


5535 


GCCCCCCG GG C TAGCTACAACGA TCTGGTGG 


6523 


3868 


AGCGGGGG G CUCCACCC 


5536 


GGGTGGAG GGCTAGCTACAACGA CCCCCGCT 


6524 


3873 


GGGGCUCC A CCCAGCAC 


5537 


GTGCTGGG GGCTAGCTACAACGA GGAGCCCC 


6525 


3878 


UCCACCCA G CACCUUCA 


5538 


TGAAGGTG GGCTAGCTACAACGA TGGGTGGA 


6526 


3880 


CACCCAGC A CCUUCAAA 


5539 


TTTGAAGG GGCTAGCTACAACGA GCTGGGTG 


6527 


3892 


UCAAAGGG A CACCUACG 


5540 


CGTAGGTG GGCTAGCTACAACGA CCCTTTGA 


6528 


3894 


AAAGGGAC A CCUACGGC 


5541 


GCCGTAGG GGCTAGCTACAACGA GTCCCTTT 


6529 


3898 


GGACACCU A CGGCAGAG 


5542 


CTCTGCCG GGCTAGCTACAACGA AGGTGTCC 


6530 


3901 


CACCUACG G CAGAGAAC 


5543 


GTTCTCTG GGCTAGCTACAACGA CGTAGGTG 


6531 


3908 


GGCAGAGA A CCCAGAGU 


5544 


ACTCTGGG GGCTAGCTACAACGA TCTCTGCC 


6532 


3915 


AACCCAGA G UACCUGGG 


5545 


CCCAGGTA GGCTAGCTACAACGA TCTGGGTT 


6533 


3917 


CCCAGAGU A CCUGGGUC 


5546 


GACCCAGG GGCTAGCTACAACGA ACTCTGGG 


6534 


3923 


GUACCUGG G UCUGGACG 


5547 


CGTCCAGA GGCTAGCTACAACGA CCAGGTAC 


6535 


3929 


GGGUCUGG A CGUGCCAG 


5548 


CTGGCACG GGCTAGCTACAACGA CCAGACCC 


6536 


3931 


GUCUGGAC G UGCCAGUG 


5549 


CACTGGCA GGCTAGCTACAACGA GTCCAGAC 


6537 


3933 


CUGGACGU G CCAGUGUG 


5550 


CACACTGG GGCTAGCTACAACGA ACGTCCAG 


6538 


3937 


ACGUGCCA G UGUGAACC 


5551 


GGTTCACA GGCTAGCTACAACGA TGGCACGT 


6539 


3939 


GUGCCAGU G UGAACCAG 


5552 


CTGGTTCA GGCTAGCTACAACGA ACTGGCAC 


6540 


3943 


CAGUGUGA A CCAGAAGG 


5553 


CCTTCTGG GGCTAGCTACAACGA TCACACTG 


6541 


3951 


ACCAGAAG G CCAAGUCC 


5554 


GGACTTGG GGCTAGCTACAACGA CTTCTGGT 


6542 


3956 


AAGGCCAA G UCCGCAGA 


5555 


TCTGCGGA GGCTAGCTACAACGA TTGGCCTT 


6543 


3960 


CCAAGUCC G CAGAAGCC 


5556 


GGCTTCTG GGCTAGCTACAACGA GGACTTGG 


6544 


3966 


CCGCAGAA G CCCUGAUG 


5557 


CATCAGGG GGCTAGCTACAACGA TTCTGCGG 


6545 


3972 


AAGCCCUG A UGUGUCCU 


5558 


AGGACACA GGCTAGCTACAACGA CAGGGCTT 


6546 


3974 


GCCCUGAU G UGUCCUCA 


5559 


TGAGGACA GGCTAGCTACAACGA ATCAGGGC 


6547 


3976 


CCUGAUGU G UCCUCAGG 


5560 


CCTGAGGA GGCTAGCTACAACGA ACATCAGG 


6548 


3987 


CUCAGGGA G CAGGGAAG 


5561 


CTTCCCTG GGCTAGCTACAACGA TCCCTGAG 


6549 


3996 


CAGGGAAG G CCUGACUU 


5562 


AAGTCAGG GGCTAGCTACAACGA CTTCCCTG 


6550 


4001 


AAGGCCUG A CUUCUGCU 


5563 


AGCAGAAG GGCTAGCTACAACGA CAGGCCTT 


6551 


4007 


UGACUUCU G CUGGCAUC 


5564 


GATGCCAG GGCTAGCTACAACGA AGAAGTCA 


6552 


4011 


UUCUGCUG G CAUCAAGA 


5565 


TCTTGATG GGCTAGCTACAACGA CAGCAGAA 


6553 


4013 


CUGCUGGC A UCAAGAGG 


5566 


CCTCTTGA GGCTAGCTACAACGA GCCAGCAG 


6554 


4021 


AUCAAGAG G UGGGAGGG 


5567 


CCCTCCCA GGCTAGCTACAACGA CTCTTGAT 


6555 


4029 


GUGGGAGG G CCCUCCGA 


5568 


TCGGAGGG GGCTAGCTACAACGA CCTCCCAC 


6556 


4037 


GCCCUCCG A CCACUUCC 


5569 


GGAAGTGG GGCTAGCTACAACGA CGGAGGGC 


6557 


4040 


CUCCGACC A CUUCCAGG 


5570 


CCTGGAAG GGCTAGCTACAACGA GGTCGGAG 


6558 


4052 


CCAGGGGA A CCUGCCAU 


5571 


ATGGCAGG GGCTAGCTACAACGA TCCCCTGG 


6559 


4056 


GGGAACCU G CCAUGCCA 


5572 


TGGCATGG GGCTAGCTACAACGA AGGTTCCC 


6560 


4059 


AACCUGCC A UGCCAGGA 


5573 


TCCTGGCA GGCTAGCTACAACGA GGCAGGTT 


6561 


4061 


CCUGCCAU G CCAGGAAC 


5574 


GTTCCTGG GGCTAGCTACAACGA ATGGCAGG 


6562 


4068 


UGCCAGGA A CCUGUCCU 


5575 


AGGACAGG GGCTAGCTACAACGA TCCTGGCA 


6563 


4072 


AGGAACCU G UCCUAAGG 


5576 


CCTTAGGA GGCTAGCTACAACGA AGGTTCCT 


6564 


4082 


CCUAAGGA A CCUUCCUU 


5577 


AAGGAAGG GGCTAGCTACAACGA TCCTTAGG 


6565 


4094 


UCCUUCCU G CUUGAGUU 


5578 


AACTCAAG GGCTAGCTACAACGA AGGAAGGA ! 


6566 


4100 


CUGCUUGA G UUCCCAGA 


5579 


TCTGGGAA GGCTAGCTACAACGA TCAAGCAG 


6567 


4108 


GUUCCCAG A UGGCUGGA 


5580 


TCCAGCCA GGCTAGCTACAACGA CTGGGAAC 


6568 


4111 


CCCAGAUG G CUGGAAGG 


5581 


CCTTCCAG GGCTAGCTACAACGA CATCTGGG 


6569 


4121 


UGGAAGGG G UCCAGCCU 


5582 


AGGCTGGA GGCTAGCTACAACGA CCCTTCCA 


6570 


4126 


GGGGUCCA G CCUCGUUG 


5583 


CAACGAGG GGCTAGCTACAACGA TGGACCCC 


6571 


4131 


CCAGCCUC G UUGGAAGA 


5584 


TCTTCCAA GGCTAGCTACAACGA GAGGCTGG 


6572 


4143 


GAAGAGGA A CAGCACUG 


5585 


CAGTGCTG GGCTAGCTACAACGA TCCTCTTC 


6573 


4146 


GAGGAACA G CACUGGGG 


5586 


CCCCAGTG GGCTAGCTACAACGA TGTTCCTC 


6574 
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4148 


p'P' a a p" a p i p i 7v /it tp 1 P'P 1 p* 7v p* 
GGAACAGC A CUGGGGAG 


5587 


GAGGGCAG GGL1AGGIAGAAGGA GGlGIiGG 


6575 


4156 


7\ /™TT TP , P , P , P < A PI TTP'TTTTTTP'TTPt 

ACUGGGGA G UCUUUGUG 


5588 


PAP717\HPT\ PPPTTiPOTHPARPPTi TPPPPTin 1 

GAG AAAGA GGL 1 AGG 1 AGAAGGA I GGGGAG X 


6576 


/in o 


GAV3UL.UUU G UbuAUULU 


5589 


SPSTiTPPA PPPTAPPTS P 1 A A /T" 1 A A A A P> A prpP 

ALjAAIGLA GGL- lAuLl AGAAGGA AAAGAV-IL. 


6577 


4166 


P*TTTTTTP^f TP1P» TV T TT TPT TP 1 TV pi P* 

CUUUGUGG A UUCUGAGG 


5590 


PPTPTlPTlTi PPPTAPPTTiPTi^PPTi PP7lP7\ ASP 

CGIGAGAA GGCTAGCTACAACGA GCACAAAG 


6578 


4174 


7\ T Tr/TPr TP< 7\ P* P" {TT*n f /~T^f~* 

AUUCUGAG G CCCUGCCC 


5591 


P'P'P'P'A P'P'P' PPPTAPPfp7\PH7lPP7\ PTPAPTIAT 

GGGGAGGG GGGIAGGTAGAAGGA L1CAGAAI 


6579 


4J. /y 


r~i t\ r^r^r^r^r^xT r* p'p'p" a at tp 1 a 
IjAuoLLLU G CCCAAUGA 


5592 


TPATTPPP PPPTAPPTTiPAAPPTi TiPPPPPTP 

1GA11GGG GGC1 AGG 1 AGAAGGA AGGGGG1C 


6580 


A ~\ Q A 

4±o4 


P'P'T Tr"PPP7i a t ir~t a P 1 a m tp 1 
CCUGCCCA A UGAvjAGUL. 


5593 


C* A PTPTPA PPPfpAPPfpAPTiTkPPli rp/-iy-i/i (OTV P»P» 

vjAVjIuICA GGG1AGL.1 AGAAGGA 1GGGGAGG 


6581 


/TOO 


p'P'a at/tp 1 ap 1 7\ p*r tp'TT a p^p 1 
LLAA U GAG A C U C U AGGG 


5594 


pppmjip7ip PPPTBPPTTtPATiPPR PTPTiTTPP 

GGG1AGAG GGL. I AGG 1 AGAAGGA CJ.GA11GG 


6582 


4197 


A P T fTP , TTAP<P» P< T TP" P 1 A P*! TP'P* 

AGUGUAGG G UGGAIjUGG 


5595 


PPTiPTPPA /^P I P"T , AP 1 P"T 1 A PA A PV*" A PPTAPTVPT* 

GGAG X GGA GG L 1 AGG 1 AGAAGGA L C JL AGAG J. 


6583 


4 Z UZ 


7ap2P ,, p i ' t tp<pia P 1 Ticcmit^rr* 
AGGGUGGA \j UGoAUGGG 


5596 


P'P 1 P'A r rp'r»A PPPTAPPTA P'A A PT 11 A TPPTaPPPT 1 

GvjvJAXGGA GGL. IAvjL.1 AGAAGGA JLGGAGGGJ. 


6584 


A o n £T 

4 Z Ub 


UGGAGUGG A Ubrt-tJAGAG 


5597 


P"PP lr PP , P»P'A PPPTAPPTAPAAPPA PPAPTPPTA 

GIGiGGLA GGG I AGG 1 AGAAGGA CCAG1GGA 


6585 


4208 


rtTVPTTPCTkTT P* P>/"'AP , 7\P'P'P< 

CAGUGGAU G CCACAGCC 


5598 


/-irirtmpmpri pripmiv ri P«T> APA APPA A T P< P> A P"T«/-1 

GGCTGTGG GGCTAGCTACAACGA ATCCACTG 


6586 


4211 


UGGAUGCC A GAGGGCAG 


5599 


P"T , P'P 1 P'P"T'P» PPPfPAPPTAPAAPPA P'P'P'AT'P'P'A 

GIGGGCTG GGC1AGGTACAACGA GGCA1CCA 


6587 


4214 


attp*p«pta pia p» p»p»piap»p«tttt 
AUGCCACA G CCCAGCUU 


5600 


A APPTPPP PPPHP APPTiTi PTi A PP A TPTPPPTt T* 

AAGCTGGG GGCTAGCTACAACGA TGTGGCAT 


6588 




nrApnppn p 1 p , ttttpip»pip'pi 
AGAGGGGA G CUUGGCCC 


5601 


PPPPPRAP pppmnpp»riT\ P> A A C(~*7\ TP 1 P* P 1 P"T > P 1 T* 

GGGCGAAG GGL 1 AGG I AGAAGGA 1GGGG1G1 


6589 


4224 


P'P>7V P'P'T TT TP* P» P'P'P'T TTTT TP<pi 

CCAGCUUG G CCCUUUCC 


5602 


GGAAAGGG GGCTAGCTACAACGA CAAGCTGG 


6590 


4239 


CCUUCCAG A UCCUGGGU 


5603 


A P»PTP»A PIP* A pppmiippmTipA APPJV P"T>OP«TV TV P»P» 

ACCCAGGA GGCTAGCTACAACGA CTGGAAGG 


6591 


4246 


PI 7V TT/^/*ltTr<0 P* T T 7V PI TPI 71 TV TV 

GAUCCUGG G UACUGAAA 


5604 


TTTCAGTA GGCTAGCTACAACGA CCAGGATC 


6592 


4248 


UCCUGGGU A CUGAAAGC 


5605 


f\ /*1IT»Tini OA P PPpmAPPmAPATlPPA TV P1J~1/T>1 riPA 

GCTTTCAG GGCTAGCTACAACGA ACCCAGGA 


6593 


4255 


T T T\ /TJ T<^t TV T\ TV f~t A~<T TT T7\ /™»/T/~< 

UAGUGAAA G CCUUAGGG 


5606 


CCCTAAGG GGCTAGCTACAACGA TTTCAGTA 


6594 


4266 


uuagggaa G CUGGCCUG 


5607 


CAGGCCAG GGCTAGCTACAACGA TTCCCTAA 


6595 


4270 


ggaagcug G CCUGAGAG 


5608 


CTCTCAGG GGCTAGCTACAACGA CAGCTTCC 


6596 


4284 


PI 7V Pt PI P»P« Ti TV PI PIPIPtPiPlPITTTV 

gaggggaa g cggcccua 


5609 


TAGGGCCG GGCTAGCTACAACGA TTCCCCTC 


6597 


4287 


gggaagcg g cccuaagg 


5610 


CCTTAGGG GGCTAGCTACAACGA CGCTTCCC 


6598 


4298 


cuaaggga g ugucuaag 


5611 


CTTAGACA GGCTAGCTACAACGA TCCCTTAG 


6599 


4300 


aagggagu g ucuaagaa 


5612 


TTCTTAGA GGCTAGCTACAACGA ACTCCCTT 


6600 


4308 


gucuaaga a caaaagcg 


5613 


CGCTTTTG GGCTAGCTACAACGA TCTTAGAC 


6601 


4314 


GAACAAAA g cgacccau 


5614 


ATGGGTCG GGCTAGCTACAACGA TTTTGTTC 


6602 


4317 


T\ TV TV TV v*i /^l/ - ^ TV i^l /~t TV T TT TV 

caaaagcg a cccauuca 


5615 


TGAATGGG GGCTAGCTACAACGA CGCTTTTG 


6603 


4321 


agcgaccc a uucagaga 


5616 


TCTCTGAA GGCTAGCTACAACGA GGGTCGCT 


6604 


4329 


7\ T TT Tf TV /**! TV f** TV /"IT Ty"1T T/~1 /T AIT T 

AUUCAGAG A CUGUCCCU 


5617 


AGGGACAG GGCTAGCTACAACGA CTCTGAAT 


6605 


4332 


t~% T\ A~1 TV TV /TT T /T TTO/^/^TTn T\ TV 

CAGAGACU G UCCCUGAA 


5618 


TTCAGGGA GGCTAGCTACAACGA AGTCTCTG 


6606 


4341 


UCCCUGAA A CCUAGUAC 


5619 


GTACTAGG GGCTAGCTACAACGA TTCAGGGA 


6607 


4346 


f-% T\ TV TV ririT TT\ /""I T Y TV /*tT T/^ /T^T/T 

GAAACCUA G UACUGCCC 


5620 


GGGCAGTA GGCTAGCTACAACGA TAGGTTTC 


6608 


4348 


AACCUAGU A CUGCCCCC 


5621 


PIPIPIpl/tPITX /^PIOITITV P»PWT»TV P1TV TV /nP^TV TV Ptm TV PI PIITim 

GGGGGCAG GGCTAGCTACAACGA ACTAGGTT 


6609 


4351 


CUAGUACU G CCCCCCAU 


5622 


ATGGGGGG GGCTAGCTACAACGA AGTACTAG 


6610 


4358 


UGCCCCCC A UGAGGAAG 


5623 


CTTCCTCA GGCTAGCTACAACGA GGGGGGCA 


6611 


4369 


TV r\ TV TV ^T TV TV /^t TV /^t y*1 7i TV T 

AGGAAGGA A CAGCAAuG 


5624 


CATTGCTG GGCTAGCTACAACGA TCCTTCCT 


6612 


4372 


7A TV TV T\ /~\ TV j^I /T TV TV T T/*1/*t)T T/^1 

AAGGAACA G CAAUGGUG 


5625 


CACCATTG GGCTAGCTACAACGA TGTTCCTT 


6613 


4375 


GAACAGCA A UGGUGUCA 


5626 


TGACACCA GGCTAGCTACAACGA TGCTGTTC 


6614 


4378 


CAGCAAUG G UGUCAGUA 


5627 


TACTGACA GGCTAGCTACAACGA CATTGCTG 


6615 


4380 


GCAAUGGU G UCAGUAUC 


5628 


GATACTGA GGCTAGCTACAACGA ACCATTGC 


6616 


4384 


UGGUGUCA G UAUCCAGG 


5629 


CCTGGATA GGCTAGCTACAACGA TGACACCA 


6617 


4386 


GUGUCAGU A UCCAGGCU 


5630 


AGCCTGGA GGCTAGCTACAACGA ACTGACAC 


6618 


4392 


GUAUCCAG G CUUUGUAC 


5631 


GTACAAAG GGCTAGCTACAACGA CTGGATAC 


6619 


4397 


CAGGCUUU G UACAGAGU 


5632 


ACTCTGTA GGCTAGCTACAACGA AAAGCCTG 


6620 


4399 


GGCUUUGU A CAGAGUGC 


5633 


GCACTCTG GGCTAGCTACAACGA ACAAAGCC 


6621 


4404 


UGUACAGA G UGCUUUUC 


5634 


GAAAAGCA GGCTAGCTACAACGA TCTGTACA 


6622 


4406 


UACAGAGU G CUUUUCUG 


5635 


CAGAAAAG GGCTAGCTACAACGA ACTCTGTA 


6623 


4414 


GCUUUUCU G UUUAGUUU 


5636 


AAACTAAA GGCTAGCTACAACGA AGAAAAGC 


6624 


4419 


UCUGUUUA G UUUUUACU 


5637 


AGTAAAAA GGCTAGCTACAACGA TAAACAGA 


6625 


4425 


UAGUUUUU A CUUUUUUU 


5638 


AAAAAAAG GGCTAGCTACAACGA AAAAACTA 


6626 
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4434 


CUUUUUUU G UUUUGUUU 


5639 


AAACAAAA GG CTAGCTACAACGA AAAAAAAG 


6627 


4439 


UUUGUUUU G UUUUUUUA 


5640 


TAAAAAAA GGCTAGCTACAACGA AAAACAAA 


6628 


4451 


UUUUAAAG A UGAAAUAA 


5641 


TTATTTCA GGCTAGCTACAACGA CTTTAAAA 


6629 


4456 


AAGAUGAA A UAAAGACC 


5642 


GGTCTTTA GGCTAGCTACAACGA TTCATCTT 


6630 


4462 


AAAUAAAG A CCCAGGGG 


5643 


CCCCTGGG GGCTAGCTACAACGA CTTTATTT 


6631 



Input Sequence = HSERB2R. Cut Site « R/Y 

Arm Length = 8. Core Sequence = GGCTAGCTACAACGA 

HSERB2R (Human c-erb-B-2 mRNA; 4473 bp) 
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Table V: Human HER2 Synthetic DNAzyme and Target molecules 



Gene 


Pos 


Target 


Seq 
ID 


RP!# 


DNAzyme 


Seq ID 


erbB2 


377 


CCACCA A UGCCAG 


6632 


24998 


cuggca GGCTAGCTACAACGA uggugg B 


6637 


erbB2 


766 


UUCUCCG A UGUGUAA 


6633 


24999 


uuacaca GGCTAGCTACAACGA cggagaa B 


6638 


erbB2 


1202 


UGUGCU A UGGUCU 


6634 


25000 


agacca GGCTAGCTACAACGA agcaca B 


6639 


erbB2 


1444 


CCUCAGC G UCUUCCA 


6635 


25001 


uggaaga GGCTAGCTACAACGA gcugagg B 


6640 


erbB2 


1583 


AUCCACC A UAACACC 


6636 


25002 


gguguua GGCTAGCTACAACGA gguggau B 


6641 



A, G, C, T (italic) = deoxy 
lower case = 2'-0-methyl 
B = inverted deoxyabasic derivative 
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Table VI: Human HIV Hammerhead Ribozyme and Substrate Sequence 



Substrate 


Seq 
ID 


Hammerhead 


Seq 
ID 


AUAAAGCU U GCCUUGAG 


6642 


CUCAAGGC CUGAUGAGGCCGUUAGGCCGAA AGCUUUAU 


6727 


AGGCUAAU U UUUUAGGG 


6643 


CCCUAAAA CUGAUGAGGCCGUUAGGCCGAA AUUAGCCU 


6728 


GGCUAAUU U UUUAGGGA 


6644 


UCCCUAAA CUGAUGAGGCCGUUAGGCCGAA AAUUAGCC 


6729 


GCCUCAAU A AAGCUUGC 


6645 


GCAAGCUU CUGAUGAGGCCGUUAGGCCGAA AUUGAGGC 


6730 


UUUCGGGU U UAUUACAG 


6646 


CUGUAAUA CUGAUGAGGCCGUUAGGCCGAA ACCCGAAA 


6731 


GCAGGACU C GGCUUGCU 


6647 


AGCAAGCC CUGAUGAGGCCGUUAGGCCGAA AGUCCUGC 


6732 



Input Sequence = HIV1 . Cut Site = UH/ . 

Arm Length = 8. Core Sequence = CUGAUGAG GCCGUUAGGC CGAA 
HIVl Consensus 



Underlined region can be any X sequence or linker, as described herein. 
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Table VH: Human HIV Inozyme and Substrate Sequence 



Substrate 


Seq 
ID 


Inozyme 


Seq 
ID 


UGGAAAAC A GAUGGCAG 


6648 


CUGCCAUC CUGAUGAGGCCGUUAGGCCGAA IUUUUCCA 


6733 


AAUAAAGC U UGCCUUGA 


6649 


UCAAGGCA CUGAUGAGGCCGUUAGGCCGAA ICUUUAUU 


6734 


UCUCUAGC A GUGGCGCC 


6650 


GGCGCCAC CUGAUGAGGCCGUUAGGCCGAA ICUAGAGA 


6735 


GGAGCCAC C CCACAAGA 


6651 


UCUUGUGG CUGAUGAGGCCGUUAGGCCGAA IUGGCUCC 


6736 


AGUGGCGC C CGAACAGG 


6652 


CCUGUUCG CUGAUGAGGCCGUUAGGCCGAA ICGCCACU 


6737 


GUGGCGCC C GAACAGGG 


6653 


CCCUGUUC CUGAUGAGGCCGUUAGGCCGAA IGCGCCAC 


6738 


CUCGACGC A GGACUCGG 


6654 


CCGAGUCC CUGAUGAGGCCGUUAGGCCGAA ICGUCGAG 


6739 


CGCAGGAC U CGGCUUGC 


6655 


GCAAGCCG CUGAUGAGGCCGUUAGGCCGAA IUCCUGCG 


6740 



Input Sequence =» HIV1 . Cut Site = CH/*. 

Arm Length = 8. Core Sequence = CUGAUGAG GCCGUUAGGC CGAA 
HIV1 Consensus 



Underlined region can be any X sequence or linker, as described herein. 
"I" stands for Inosine. 
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Table VIII: Human HIV Zinzyme and Substrate Sequence 



Substrate 


Seq 
ID 


Zinzyme 


Seq 
ID 


UCAAUAAA G CUUGCCUU 


6656 


AAGGCAAG GCCGAAAGGCGAGUGAGGUCU UUUAUUGA 


6741 


AGGACUCG G CUUGCUGA 


6657 


UCAGCAAG GCCGAAAGGCGAGUGAGGUCU CGAGUCCU 


6742 


GCAGUGGC G CCCGAACA 


6658 


UGUUCGGG GCCGAAAGGCGAGUGAGGUCU GCCACUGC 


6743 


CUCUAGCA G UGGCGCCC 


6659 


GGGCGCCA GCCGAAAGGCGAGUGAGGUCU UGCUAGAG 


6744 


UAGCAGUG G CGCCCGAA 


6660 


UUCGGGCG GCCGAAAGGCGAGUGAGGUCU CACUGCUA 


6745 


AGAGAUGG G UGCGAGAG 


6661 


CUCUCGCA GCCGAAAGGCGAGUGAGGUCU CCAUCUCU 


6746 


AGAUGGGU G CGAGAGCG 


6662 


CGCUCUCG GCCGAAAGGCGAGUGAGGUCU ACCCAUCU 


6747 


CUCUCGAC G CAGGACUC 


6663 


GAGUCCUG GCCGAAAGGCGAGUGAGGUCU GUCGAGAG 


6748 



Input Sequence = HIV1. Cut Site = G/Y 

Arm Length =» 8 . Core Sequence « GCcgaaagGCGaGuCaaGGuCu 
HIV1 Consensus 
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Table IX: Human HIV DNAzyme and Substrate Sequence 



Substrate 


Seq 
ID 


DNAzyme 


Seq 
ID 


UCAAUAAA G CUUGCCUU 


6656 


AAGGCAAG GG CTAGCTACAACGA TTTATTGA 


6749 


AGGACUCG G CUUGCUGA 


6657 


TCAGCAAG GGCTAGCTACAACGA CGAGTCCT 


6750 


GCAGUGGC G CCCGAACA 


6658 


TGTTCGGG GGCTAGCTACAACGA GCCACTGC 


6751 


CUCUAGCA G UGGCGCCC 


6659 


GGGCGCCA GGCTAGCTACAACGA TGCTAGAG 


6752 


UAGCAGUG G CGCCCGAA 


6660 


TTCGGGCG GGCTAGCTACAACGA CACTGCTA 


6753 


AGAGAUGG G UGCGAGAG 


6661 


CTCTCGCA GGCTAGCTACAACGA CCATCTCT 


6754 


AGAUGGGU G CGAGAGCG 


6662 


CGCTCTCG GGCTAGCTACAACGA ACCCATCT 


6755 


CUCUCGAC G CAGGACUC 


6663 


GAGTCCTG GGCTAGCTACAACGA GTCGAGAG 


6756 


UAUGGAAA A CAGAUGGC 


6664 


GCCATCTG GGCTAGCTACAACGA TTTCCATA 


6757 


GAAAACAG A UGGCAGGU 


6665 


ACCTGCCA GGCTAGCTACAACGA CTGTTTTC 


6758 


AAGCCUCA A UAAAGCUU 


6666 


AAGCTTTA GGCTAGCTACAACGA TGAGGCTT 


6759 


GGAGAGAG A UGGGUGCG 


6667 


CGCACCCA GGCTAGCTACAACGA CTCTCTCC 


6760 


GACGCAGG A CUCGGCUU 


6668 


AAGCCGAG GGCTAGCTACAACGA CCTGCGTC 


6761 



Input Sequence = HIV1 . Cut Site = R/Y 

Arm Length n 8 . Core Sequence = GGCTAGCTACAACGA 

HIV1 Consensus 
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Table XI: Human HIV Enzymatic Nucleic Acid and Target molecules 



Target 


Seq ID 


RPI# 


Enzymatic Nucleic Acid 


Seq ID 


GAGAUGG G UGCGAGA 


6718 


25003 


ucucgca GGCTAGCTACAACGA ccaucuc B 


6790 


AUGGAAAACAGAUGG 


6719 


25004 


ccaucug GGCTAGCTACAACGA uuuccau B 


6791 


AAAACAG A UGGCAGG 


6720 


25005 


ccugcca GGCTAGCTACAACGA cuguuuu B 


6792 


AGCCUCA A UAAAGCU 


6721 


25006 


agcuuua GGCTAGCTACAACGA ugaggcu B 


6793 


GAGAGAG A UGGGUGC 


6722 


25007 


gcaccca GGCTAGCTACAACGA cucucuc B 


6794 


CAAUAAA G CUUGCCU 


6723 


25008 


aggcaag gccgaaaggCgagugaGGuCu uuuauug B 


6795 


GGACUCG G CUUGCUG 


6724 


25009 


cagcaag gccgaaaggCgagugaGGuCu cgagucc B 


6796 


GAGAUGG G UGCGAGA 


6718 


25010 


ucucgca gccgaaaggCgagugaGGuCu ccaucuc B 


6797 


GAUGGGU G CGAGAGC 


6725 


25011 


gcucucg gccgaaaggCgagugaGGuCu acccauc B 


6798 


UCUCGAC G CAGGACU 


6726 


25012 


aguccug gccgaaaggCgagugaGGuCu gucgaga B 


6799 



G = Guanosine 

A, G, C, T (italic) = deoxy 

lower case = 2'-0-methyl 

s = phosphorothioate 3'-internucleotide linkage 

C = 2'-deoxy-2'-Amino cytidine 

B = inverted deoxyabasic derivative 
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Table XII: Human HIV-1 Sequences 



vjci i udi 1 1\ 

Acc# 


Sea Namefsl 


SubtvDG 


Oraanism 


A04321 


MB LAI 


B 


HIV-1 


AF 110962 


96BW0402 


C 


HIV-1 


AF1 10963 


96BW0407 


C 


HIV-1 


AF1 10968 


96BW0504 


C 


HIV-1 


AF1 10965 


96BW0409 


c 


HIV-1 


AF1 10966 


96BW0410 


c 


HIV-1 


AF1 10964 


96BW0408 


c 


HIV-1 


AF1 10975 


96BW15C05 


c 


HIV-1 


AF1 10974 


96BW15C02 


c 


HIV-1 


AF1 10973 


96BW15B03 


c 


HIV-1 


AF1 07771 


UGSE8131 


A 


HIV-1 


U69585 


WCIPR854 


B 


HIV-1 


U69588 


WCIPR855 


B 


HIV-1 t 


U69589 


WCIPR9011 


B 


HIV-1 


U69591 


WCIPR9018 


B 


HIV-1 


U69592 


WCIPR9031 


B 


HIV-1 


U69593 


WCIPR9032 


B 


HIV-1 


U69586 


WCIPR8546 


B 


HIV-1 


AF003888 


NL43WC001 


B 


HIV-1 


X01762 


REHTLV3 LAI NIB 


B 


HIV-1 


AF075719 


MNTQ MNcloneTQ 


B 


HIV-1 


AJ239083 


97CAMP645MO 


MO 


HIV-1 


D86069 


PM213 


B 


HIV-1 


K02083 


PV22 


B 


HIV-1 


M93259 


YU10 


B 


HIV-1 


Z11530 


F12CG 


B 


HIV-1 


AB032740 


TH022 95TNIH022 


CRF01_AE 


HIV-1 


AF1 07770 


SE7812 


CRF02_AG 


HIV-1 


AF070521 


NL43E9 


B 


HIV-1 


AF033819 


HXB2-copy LAI 


B 


HIV-1 


AF003887 


WC001 


B 


HIV-1 


AF069140 


DH123 


B 


HIV-1 


AF1 10967 


96BW0502 


C 


HIV-1 


K03455 


HXB2 HXB2CG 


B 


HIV-1 


M96155 


P896 89.6 


B 


HIV-1 


X04415 


MAL MALCG 


ADK 


HIV-1 


AF1 33821 


MB2059 


D 


HIV-1 


D86068 


MCK1 


B 


HIV-1 


U69587 


WCIPR8552 


B 


HIV-1 


U69590 


WCIPR9012 


B 


HIV-1 


AB032741 


95TNIH047 TH047 


CRF01_AE 


HIV-1 


AB023804 


93IN101 


C 


HIV-1 


AF1 93275 


97BL006 


A 


HIV-1 


AF1 97340 


90CF11697 


CRF01_AE 


HIV-1 


AF224507 


WK 


! B 


HIV-1 


AJ271445 


GB8 GB8-46R 


i B 


HIV-1 


AF1 97338 


93TH057 


CRF01_AE 


HIV-1 


AF1 97339 


93TH065 


CRF01_AE 


HIV-1 


AF197341 


90CF4071 


CRF01_AE 


HIV-1 



wo 
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U69584 


85WCIPR54 


B 


HIV-1 | 


L31963 


TH475A LAI 


B 


HIV-1 


U46016 


ETH2220 C2220 


C 


HIV-1 


U21135 


WEAU160 GHOSH 


B 


HIV-1 


AF042106 


MBCC18R01 


B 


HIV-1 


K03454 


ELI 


D 


HIV-1 


U51188 


90CF402 90CR402 


CRF01J\E 


HIV-1 


U51189 


93TH253 


CRF01_AE 


HIV-1 


U34603 


H0320-2A12 


B 


HIV-1 


M38429 


JRCSFJR-CSF 


B 


HIV-1 


M17451 


RF HAT3 


B 


HIV-1 


L02317 


BC BCSG3 


B 


HIV-1 


M93258 


YU2 YU2X 


B 


HIV-1 


M22639 


Z2Z6 Z2 CDC-Z34 


D 


HIV-1 


AF004394 


AD8, AD87 ADA 


B 


HIV-1 


AF049337 


94CY032-3 


CRF04_cpx 


HIV-1 


U34604 


3202A21 


B 


HIV-1 


L20587 


ANT70 


0 


HIV-1 


D10112 


CAM1 


B 


HIV-1 


U54771 


CM240 


CRF01_AE 


HIV-1 


U43096 


D31 


B 


HIV-1 


U37270 


C18MBC 


B 


HIV-1 


U43141 


HAN 


B 


HIV-1 


U23487 


MANC 


B 


HIV-1 


M17449 


MNCG MN 


B 


HIV-1 


L20571 


MVP5180 


0 


HIV-1 


M27323 


NDK 


D 


HIV-1 


M38431 


NY5CG 


B 


HIV-1 


M26727 


OYI, 397 


B 


HIV-1 


K02007 


SF2 LAV2 ARV2 


B 


HIV-1 


M62320 


U455 U455A 


A 


HIV-1 


U26546 


WR27 


B 


HIV-1 


AF004885 


Q23 


A 


HIV-1 


AF042100 


MBC200 


B 


HIV-1 


AF042101 


MBC925 


B 


HIV-1 


AJ006287 


89SP061 89ES061 


B 


HIV-1 


AF067154 


93IN999 301999 


C 


HIV-1 


AF067155 


95IN21068 21068 


C 


HIV-1 


AJ006022 


YBF30 


N 


HIV-1 


AF061642 


SE6165 G6165 


G 


HIV-1 


AF1 19820 


97PVCH GR11 


CRF04_cpx 


HIV-1 


AF1 19819 


97PVMY GR84 


CRF04_cpx 


HIV-1 


K02013 


LAI BRU 


B 


HIV-1 


L39106 


IBNG 


CRF02_AG 


HIV-1 


U12055 


LW123 


B 


HIV-1 


M19921 


NL43 pNL43 


B 


HIV-1 


AF061640 


HH8793-1.1 


G 


HIV-1 


AF061641 


HH8793-12.1 


G 


HIV-1 


AF063223 


DJ263 


CRF02_AG 


HIV-1 


AF049495 


NC7 


B 


HIV-1 


AF049494 


499JC16 


B 


HIV-1 


AF086817 


TWCYS LM49 
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CLAIMS 

What we claim is: 

1. A siRNA nucleic acid molecule that modulates expression of a nucleic acid 
molecule encoding HER2. 

2. A enzymatic nucleic acid molecule that modulates expression of a nucleic 
acid molecule encoding HER2. 

3. An enzymatic nucleic acid molecule comprising a sequence selected from the 
group consisting of SEQ ID NOs: 5644-6631 and 6637-6641. 

4. An enzymatic nucleic acid molecule comprising at least one binding arm 
wherein one or more of said binding arms comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID 
NOs: 4656-5643 and 6632-6636. 

5. A siRNA nucleic acid molecule comprising a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NOs: 4656-5643 and 
6632-6636. 

6. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule is adapted to treat cancer. 

7. The enzymatic nucleic acid molecule of any of claims 2-4, wherein said 
enzymatic nucleic acid molecule has an endonuclease activity to cleave RNA 
having HER2 sequence. 

8. The enzymatic nucleic acid molecule of claim 2, wherein said enzymatic 
nucleic acid molecule is a DNAzyme in a 10-23 configuration. 

9. The enzymatic nucleic acid molecule of claim 8, wherein said enzymatic 
nucleic acid molecule comprises a sequence complementary to a sequence 
selected from the group consisting of SEQ ID NOs: 4656-5643 and 6632- 
6636. 
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10. The enzymatic nucleic acid molecule of claim 8, wherein said enzymatic 
nucleic acid molecule comprises a sequence selected from the group 
consisting of SEQ ID NOs: 5644-6631 and 6637-6641. 

11. The nucleic acid molecule of any of claims 1, 2, 4 or 5, wherein said nucleic 
acid molecule comprises between 12 and 100 bases complementary to a 
RNA having HER2 sequence. 

12. The nucleic acid molecule of claim of any of claims 1, 2, 4 or 5, wherein said 
nucleic acid molecule comprises between 14 and 24 bases complementary to 
a RNA having HER2 sequence. 

13. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule is chemically synthesized. 

14. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule comprises at least one 2 , -sugar modification. 

15. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule comprises at least one nucleic acid base modification. 

16. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule comprises at least one phosphate backbone modification. 

17. A mammalian cell comprising the nucleic acid molecule of any of claims 1- 
5. 

18. The mammalian cell of claim 17, wherein said mammalian cell is a human 
cell. 

19. A method of reducing HER2 activity in a cell, comprising contacting said 
cell with the nucleic acid molecule of any of claims 1-5, under conditions 
suitable for said reduction of HER2 activity. 

20. A method of treatment of a subject having a condition associated with the 
level of HER2, comprising contacting cells of said subject with the nucleic 
acid molecule of any of claims 1-5, under conditions suitable for said 
treatment. 

21. The method of claim 20 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment. 
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22. A method of cleaving RNA having HER2 sequence comprising contacting 
an enzymatic nucleic acid molecule of any of claims 2-4 with said RNA 
under conditions suitable for the cleavage. 

23. The method of claim 22, wherein said cleavage is carried out in the presence 
of a divalent cation. 

24. The method of claim 23, wherein said divalent cation is Mg2+ 

25. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule comprises a cap structure, wherein the cap structure is at the 5'- 
end, 3 '-end, or both the 5 '-end and the 3 '-end of said nucleic acid molecule. 

26. The nucleic acid molecule of claim 25, wherein the cap structure at the 5'- 
end, 3 '-end, or both the 5 '-end and the 3 '-end comprises a 3', 3 '-linked or 
5',5 '-linked deoxyabasic ribose derivative. 

27. An expression vector comprising a nucleic acid sequence encoding at least 
one nucleic acid molecule of any of claims 1-5 in a manner that allows 
expression of the nucleic acid molecule. 

28. A mammalian cell comprising an expression vector of claim 27. 

29. The mammalian cell of claim 28, wherein said mammalian cell is a human 
cell. 

30. The expression vector of claim 27, wherein said nucleic acid molecule is in a 
DNAzyme configuration. 

31. The expression vector of claim 27, wherein said expression vector further 
comprises a sequence for a nucleic acid molecule complementary to a 
nucleic acid molecule having HER2 sequence. 

32. The expression vector of claim 27, wherein said expression vector comprises 
a nucleic acid sequence encoding two or more of said nucleic acid molecules, 
which may be the same or different. 

33. The expression vector of claim 32, wherein said expression vector further 
comprises a sequence encoding an antisense nucleic acid molecule or siRNA 
molecule complementary to a nucleic acid molecule having HER2 sequence. 
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34. A method for treatment of cancer comprising administering to a subject the 
nucleic acid molecule of any of claims 1-5 under conditions suitable for said 
treatment. 

35. The method of claim 34, wherein said cancer is breast cancer. 

36. The method of claim 34, wherein said cancer is ovarian cancer. 

37. The method of claim 34, wherein said method further comprises 
administering to said subject one or more other therapies under conditions 
suitable for said treatment. 

38. The method of claim 21 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 

39. The method of claim 37 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 

40. The method of claim 38, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

41. The method of claim 38, wherein said monoclonal antibody is Herceptin 
(trastuzumab). 

42. The method of claim 39, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

43. The method of claim 39, wherein said monoclonal antibody is Herceptin 
(trastuzumab). 

44. A composition comprising a nucleic acid molecule of any of claims 1-5 in a 
pharmaceutically acceptable carrier. 
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45 . A method of administering to a cell a nucleic acid molecule of any of claims 
1-5 comprising contacting said cell with the nucleic acid molecule under 
conditions suitable for said administration. 

46. The method of claim 45, wherein said cell is a mammalian cell. 

47. The method of claim 45, wherein said cell is a human cell. 

48. The method of claim 45, wherein said administration is in the presence of a 
delivery reagent. 

49. The method of claim 48, wherein said delivery reagent is a lipid. 

50. The method of claim 49, wherein said lipid is a cationic lipid. 

5 1 . The method of claim 49, wherein said lipid is a phospholipid. 

52. The method of claim 48, wherein said delivery reagent is a liposome. 

53. A siRNA nucleic acid molecule that modulates expression of a nucleic acid 
molecule encoding K-Ras. 

54. A siRNA nucleic acid molecule that modulates expression of a nucleic acid 
molecule encoding H-Ras or N-Ras. 

55. An enzymatic nucleic acid molecule that modulates expression of a nucleic 
acid molecule encoding K-Ras. 

56. An enzymatic nucleic acid molecule that moduates expression of a nucleic 
acid molecule encoding H-Ras or N-Ras. 

57. An enzymatic nucleic acid molecule comprising a sequence of SEQ ID NOs: 
2329-4655. 

58. An enzymatic nucleic acid molecule comprising at least one binding arm 
wherein one or more of said binding arms comprises a sequence 
complementary to a sequence of SEQ ID NOs: 1-2328. 



59. 



A siRNA nucleic acid molecule comprising a sequence complementary to a 
sequence of SEQ ID NOs: 1-2328. 
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60. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule is adapted to treat cancer. 

61 . The enzymatic nucleic acid molecule of any of claims 55, 57 or 58, wherein 
said enzymatic nucleic acid molecule has an endonuclease activity to cleave 
RNA having a K-Ras sequence. 

62. The enzymatic nucleic acid molecule of any of claims 56-58, wherein said 
enzymatic nucleic acid molecule has an endonuclease activity to cleave RNA 
having an H-Ras sequence. 

63. The enzymatic nucleic acid molecule of claim 55 or claim 56, wherein said 
enzymatic nucleic acid molecule is a DNAzyme in a 10-23 configuration. 

64. The enzymatic nucleic acid molecule of claim 63, wherein said enzymatic 
nucleic acid molecule comprises a sequence complementary to a sequence of 
SEQIDNOs: 1-2328. 

65. The enzymatic nucleic acid molecule of claim 63, wherein said enzymatic 
nucleic acid molecule comprises a sequence of SEQ ID NOs: 2329-4655. 

66. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises between 12 and 100 bases complementary to an RNA 
having K-Ras, H-Ras and/or N-Ras sequence. 

67. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises between 14 and 24 bases complementary to an RNA 
having K-Ras, H-Ras and/or N-Ras sequence. 

68. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule is chemically synthesized. 

69. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises at least one 2'-sugar modification. 

70. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises at least one nucleic acid base modification. 

71. The nucleic acid molecule of any of claims 53-59, wherein said enzymatic 
nucleic acid molecule comprises at least one phosphate backbone 
modification. 
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72. A mammalian cell comprising the nucleic acid molecule of any of claims 
53-59. 

73. The mammalian cell of claim 72, wherein said mammalian cell is a human 
cell. 

74. A method of reducing K-Ras activity in a cell, comprising contacting said 
cell with the nucleic acid molecule of any of claims 53, 55, 57, 58 or 59, 
under conditions suitable for said reduction of K-Ras activity. 

75. A method of reducing H-Ras activity in a cell, comprising contacting said 
cell with the nucleic acid molecule of any of claims 54, 56, 57, 58 or 59, 
under conditions suitable for said reduction of H-Ras activity. 

76. A method of treatment of a subject having a condition associated with the 
level of K-Ras, comprising contacting cells of said subject with the nucleic 
acid molecule of any of claims 53, 55, 57, 58 or 59, under conditions suitable 
for said treatment. 

77. A method of treatment of a subject having a condition associated with the 
level of H-Ras, comprising contacting cells of said subject with the nucleic 
acid molecule of any of claims 54, 56, 57, 58 or 59, under conditions suitable 
for said treatment 

78. The method of claim 76 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment. 

79. The method of claim 77 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment 

80. A method of cleaving RNA having a K-Ras sequence comprising contacting 
an nucleic acid molecule of any of claims 53, 55, 57, 58 or 59, with said 
RNA under conditions suitable for the cleavage. 

81. A method of cleaving RNA having a H-Ras sequence comprising contacting 
an nucleic acid molecule of any of claims 54, 56, 57, 58 or 59, with said 
RNA under conditions suitable for the cleavage. 

82. The method of claim 80, wherein said cleavage is carried out in the presence 
of a divalent cation. 
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83. The method of claim 81, wherein said cleavage is carried out in the presence 
of a divalent cation. 

84. The method of claim 82, wherein said divalent cation is Mg2+. 

85. The method of claim 83, wherein said divalent cation is Mg2+- 

86. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises a cap structure, wherein the cap structure is at the 5'- 
end, 3'-end, or both the 5'-end and the 3'-end of said nucleic acid molecule. 

87. The nucleic acid molecule of claim 86, wherein the cap structure comprises 
a 3 ',3 '-linked or 5 ',5 '-linked deoxyabasic ribose derivative. 

88. An expression vector comprising a nucleic acid sequence encoding at least 
one nucleic acid molecule of any of claims 53-59 in a manner that allows 
expression of the nucleic acid molecule. 

89. A mammalian cell comprising an expression vector of claim 88. 

90. The mammalian cell of claim 89, wherein said mammalian cell is a human 
cell. 

91. The expression vector of claim 88, wherein said nucleic acid molecule is in a 
DNAzyme configuration. 

92. The expression vector of claim 88, wherein said expression vector further 
comprises a sequence for a nucleic acid molecule complementary to a 
nucleic acid molecule having a K-Ras sequence. 

93. The expression vector of claim 88, wherein said expression vector further 
comprises a sequence for a nucleic acid molecule complementary to a 
nucleic acid molecule having a H-Ras sequence. 

94. The expression vector of claim 88, wherein said expression vector comprises 
a nucleic acid sequence encoding two or more of said nucleic acid molecules, 
which may be the same or different. 

95. The expression vector of claim 88, wherein said expression vector further 
comprises a sequence encoding an antisense nucleic acid molecule or siRNA 
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nucleic acid molecule complementary to a nucleic acid molecule having a K- 
Ras sequence. 

96. The expression vector of claim 88, wherein said expression vector further 
comprises a sequence encoding an antisense nucleic acid molecule or siRNA 
nucleic acid molecule complementary to a nucleic acid molecule having a H- 
Ras sequence. 

97. A method for the treatment of cancer comprising administering to a subject 
the nucleic acid molecule of any of claims 53-59 under conditions suitable 
for said treatment. 

98. The method of claim 97, wherein said cancer is colorectal cancer. 

99. The method of claim 97, wherein said cancer is lung cancer. 

100. The method of claim 97, wherein said cancer is prostate cancer. 

101 . The method of claim 97, wherein said cancer is bladder cancer. 

102. The method of claim 97, wherein said cancer is breast cancer. 

103. The method of claim 97, wherein said cancer is pancreatic cancer. 

104. The method of claim 97, wherein said method further comprises 
administering to said patient one or more other therapies under conditions 
suitable for said treatment. 

105. The method of claim 78 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 

106. The method of claim 79 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 

107. The method of claim 104 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 
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108. The method of claim 105, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

109. The method of claim 106, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

110. The method of claim 107, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

111. A composition comprising a nucleic acid molecule of any of claims 53-59 
and a pharmaceutical^ acceptable carrier. 

112. A method of administering to a cell a nucleic acid molecule of any of claims 
53-59 comprising contacting said cell with the enzymatic nucleic acid 
molecule under conditions suitable for said administration. 

1 13. The method of claim 112, wherein said cell is a mammalian cell. 

1 14. The method of claim 113, wherein said cell is a human cell. 

115. The method of claim 112, wherein said administration is in the presence of a 
delivery reagent. 

116. The method of claim 115, wherein said delivery reagent is a lipid. 

117. The method of claim 1 16, wherein said lipid is a cationic lipid. 

118. The method of claim 116, wherein said lipid is a phospholipid. 

1 1 9. The method of claim 1 1 5, wherein said delivery reagent is a liposome. 

120. A siRNA nucleic acid molecule which modulates expression of a nucleic 
acid molecule encoding HIV or a component of HIV. 
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121. An enzymatic nucleic acid molecule which modulates expression of a 
nucleic acid molecule encoding HIV or a component of HIV, wherein said 
enzymatic nucleic acid molecule is in an Inozyme, G-cleaver, Zinzyme or 
Amberzyme configuration. 

122. An enzymatic nucleic acid molecule comprising a sequence selected from the 
group consisting of SEQ ID NOs. 6727-6799, 

123. An enzymatic nucleic acid molecule comprising at least one binding arm 
wherein one or more of said binding arms comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID 
NOs. 6642-6726. 

124. A siRNA nucleic acid molecule comprising a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NOs. 6642-6726. 

125. The nucleic acid of any of claims 120-124, wherein said nucleic acid 
molecule is adapted to HIV infection or acquired immunodeficiency 
syndrome (AIDS). 

126. The enzymatic nucleic acid molecule of any of claims 121-123, wherein said 
enzymatic nucleic acid molecule has an endonuclease activity to cleave RNA 
having a HTV sequence. 

127. The enzymatic nucleic acid molecule of claim 121, wherein said enzymatic 
nucleic acid molecule is in an Inozyme configuration. 

128. The enzymatic nucleic acid molecule of claim 121, wherein said enzymatic 
nucleic acid molecule is in a Zinzyme configuration. 

129. The enzymatic nucleic acid molecule of claim 121, wherein said enzymatic 
nucleic acid molecule is in a G-cleaver configuration. 

130. The enzymatic nucleic acid molecule of claim 121, wherein said enzymatic 
nucleic acid molecule is in an Amberzyme configuration. 

131. The enzymatic nucleic acid molecule of claim 123, wherein said enzymatic 
nucleic acid molecule is in a DNAzyme configuration. 

132. The enzymatic nucleic acid molecule of claim 123, wherein said enzymatic 
nucleic acid molecule is in a Hammerhead configuration. 
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133. The enzymatic nucleic acid molecule of claim 127, wherein said Inozyme 
comprises a sequence complementary to a sequence selected from the group 
consisting of SEQ ID NOs. 6648-6655. 

134. The enzymatic nucleic acid molecule of claim 127, wherein said Inozyme 
comprises a sequence selected from the group consisting of SEQ ID NOs. 
6733-6740. 

135. The enzymatic nucleic acid molecule of claim 128, wherein said Zinzyme 
comprises a sequence complementary to a sequence selected from the group 
consisting of SEQ ID NOs. 6656-6663 and 6723-6726. 

136. The enzymatic nucleic acid molecule of claim 128, wherein said Zinzyme 
comprises a sequence selected from the group consisting of SEQ ID NOs. 
6741-6748 and 6795-6799. 

137. The enzymatic nucleic acid molecule of claim 130, wherein said Amberzyme 
comprises a sequence complementary to a sequence selected from the group 
consisting of SEQ ID NOs. 6656-6688. 

138. The enzymatic nucleic acid molecule of claim 130, wherein said Amberzyme 
comprises a sequence selected from the group consisting of SEQ ID NOs. 
6762-6789. 

139. The enzymatic nucleic acid molecule of claim 131, wherein said DNAzyrne 
comprises a sequence complementary to a sequence selected from the group 
consisting of SEQ ID NOs. 6656-6668 and 6718-6722. 

140. The enzymatic nucleic acid molecule of claim 131, wherein said DNAzyrne 
comprises a sequence selected from the group consisting of SEQ ID NOs. 
6749-6761 and 6790-6794. 

141. The enzymatic nucleic acid molecule of claim 132, wherein said 
Hammerhead comprises a sequence complementary to a sequence selected 
from the group consisting of SEQ ID NOs. 6642-6647. 

142. The enzymatic nucleic acid molecule of claim 132, wherein said 
Hammerhead comprises a sequence selected from the group consisting of 
SEQ ID NOs 6727-6732. 
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143. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises between 12 and 100 bases complementary to a 
nucleic acid molecule encoding HIV. 

144. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises between 14 and 24 bases complementary to a 
nucleic acid molecule encoding HIV. 

145. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule is chemically synthesized. 

146. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises at least one 2'-sugar modification. 

147. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises at least one nucleic acid base modification. 

148; The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises at least one phosphate backbone modification. 

149. A mammalian cell comprising the nucleic acid molecule of any of claims 
120-124 

150. The mammalian cell of claim 149, wherein said mammalian cell is a human 
cell. 

151. A method of reducing HIV activity in a cell, comprising contacting said cell 
with the nucleic acid molecule of any of claims 120-124, under conditions 
suitable for said reduction of HIV activity. 

152. A method of treatment of a subject having a condition associated with the 
level of HIV, comprising contacting cells of said subject with the nucleic 
acid molecule of any of claims 120-124, under conditions suitable for said 
treatment. 

153. The method of claim 151 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment. 



154. 



The method of claim 152 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment. 
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155. A method of cleaving RNA of an HTV gene comprising contacting an 
enzymatic nucleic acid molecule of any of claims 121-123 with said RNA of 
a HIV gene under conditions suitable for the cleavage. 

156. The method of claim 155, wherein said cleavage is carried out in the 
presence of a divalent cation. 

1 57. The method of claim 1 56, wherein said divalent cation is Mg 2+ . 

158. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises a cap structure, wherein the cap structure is at the 
5'-end, 3'-end, or both the 5'-end and the 3'-end of said nucleic acid 
molecule. 

159. The nucleic acid molecule of claim 158, wherein the cap structure at the 5'- 
end, 3 '-end, or both the 5' -end and the 3 '-end comprises a 3 ',3' -linked or 
5',5'-linked deoxyabasic ribose derivative. 

160. An expression vector comprising a nucleic acid sequence encoding at least 
one nucleic acid molecule of any of claims 120-124 in a manner which 
allows expression of the nucleic acid molecule. 

161. A mammalian cell comprising an expression vector of claim 160. 

162. The mammalian cell of claim 161, wherein said mammalian cell is a human 
cell. 

163. An expression vector comprising a nucleic acid sequence encoding at least 
one nucleic acid molecule of any of claims 122 or 123 in a manner which 
allows expression of the nucleic acid molecule, wherein said nucleic acid 
molecule is in a hammerhead configuration. 

164. The expression vector of claim 160, wherein said expression vector further 
comprises a sequence for a nucleic acid molecule complementary to the 
RNA of HIV. 

165. The expression vector of claim 160, wherein said expression vector 
comprises a nucleic acid sequence encoding two or more of said nucleic acid 
molecules, which may be the same or different. 
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166. The expression vector of claim 165, wherein said expression vector further 
comprises a sequence encoding a siKNA nucleic acid molecule 
complementary to the RNA of HIV gene. 

167. A method for treatment of acquired immunodeficiency syndrome (AIDS) or 
an AIDS related condition comprising administering to a subject the nucleic 
acid molecule of any of claims 120-124 under conditions suitable for said 
treatment. 

168. The method of claim 167, wherein said AIDS related condition is Kaposi's 
sarcoma, lymphoma, cervical cancer, squamous cell carcinoma, cardiac 
myopathy, rheumatic disease, or opportunistic infection. 

169. The method of claim 167, wherein said method further comprises 
administering to said subject one or more other therapies. 

170. The nucleic acid molecule of claim 121 or claim 123, wherein said nucleic 
acid molecule comprises at least five ribose residues, at least ten 2-0-methyl 
modifications, and a 3*- end modification. 

171. The nucleic acid molecule of claim 170, wherein said nucleic acid molecule 
further comprises phosphorothioate linkages on at least three of the 5' 
terminal nucleotides. 

172. The nucleic acid molecule of claim 170, wherein said 3'- end modification is 
a 3 '-3' inverted abasic moiety. 

173. The method of claim 153 wherein said other drug therapies chosen from 
antiviral therapy, monoclonal antibody therapy, chemotherapy, radiation 
therapy, analgesic therapy, and anti-inflammatory therapy. 

174. The method of claim 173, wherein said antiviral therapy is chosen from 
treatment with AZT, ddC, ddl, d4T, 3TC, Ribavirin, delvaridine, nevirapine, 
efravirenz, ritonavir, saquinavir, indinavir, amprenivir, nelfinavir, and 
lopinavir. 

175. The method of claim 154 wherein said other drug therapies are chosen from 
antiviral therapy, monoclonal antibody therapy, chemotherapy, radiation 
therapy, analgesic therapy, and anti-inflammatory therapy. 
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176. The method of claim 175, wherein said antiviral therapy is chosen from 
treatment with AZT, ddC, ddl, d4T, 3TC, Ribavirin, delvaridine, nevirapine, 
efravirenz, ritonavir, saquinivir, indinavir, amprenivir, nelfinavir, and 
lopinavir. 

177. The method of claim 169 wherein said other drug therapies are chosen from 
antiviral therapy, monoclonal antibody therapy, chemotherapy, radiation 
therapy, analgesic therapy, and anti-inflammatory therapy. 

178. The method of claim 177, wherein said antiviral therapy is chosen from 
treatment with AZT, ddC, ddl, d4T, 3TC, Ribavirin, delvaridine, nevirapine, 
efravirenz, ritonavir, saquinivir, indinavir, amprenivir, nelfinavir, and 
lopinavir. 

179. A pharmaceutical composition comprising a nucleic acid molecule of any of 
claims 120-124 in a pharmaceutical^ acceptable carrier. 

180. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HIV is nef. 

181. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HIVisvif. 

182. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HIV is tat. 

183. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HIV is rev. 

184. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HIV isLTR. 

185. The nucleic acid molecule of claim 1 84, wherein said LTR is the 3 '-LTR. 

186. The nucleic acid molecule of claim 184, wherein said LTR is the 5'-LTR. 

1 87. A method of administering to a cell a nucleic acid molecule of any of claims 
120-124 comprising contacting said cell with the nucleic acid molecule 
under conditions suitable for said administration. 

188. The method of claim 187, wherein said cell is a mammalian cell. 
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1 89. The method of claim 1 87, wherein said cell is a human cell. 

190. The method of claim 187, wherein said administration is in the presence of a 
delivery reagent. 

191. The method of claim 190, wherein said delivery reagent is a lipid. 

192. The method of claim 191, wherein said lipid is a cationic lipid. 

193. The method of claim 191, wherein said lipid is a phospholipid. 

1 94. The method of claim 1 90, wherein said delivery reagent is a liposome. 
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